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To our ancestors and parents, as always 


Preface 


Robot is hailed as “the pearl at the top of manufacturing crown”, and it is an 
important carrier of a new round of technological revolution and manufacturing 
integration innovation. With the continuous improvement of robot intelligence, 
human-robot integration has become the inevitable development direction of the 
new generation of robots. Human and robot play their own advantages, through 
various environments, operators and robot natural interaction, to complete complex 
work. Robots with the ability of human—computer integration will play an impor- 
tant role in intelligent manufacturing, family service, medical education, space 
exploration and other fields, with a very broad research and application prospects. 

In order to achieve human-robot integration, the ability of human—computer 
cooperation and autonomous operation is required. In tasks such as human—com- 
puter interaction and flexible assembly, the robot interacts with external environ- 
ment directly, which requires the ability to accurately control the interaction force, 
to ensure human-computer friendliness and task execution ability. On the other 
hand, due to the unstructured environment, the workspace of robot is affected by 
human beings and other objects. In order to ensure the safety of the system, the 
robot must adapt to the time-varying space constraints: for instance, when human 
beings enter the working space of robots, robots must be able to avoid human 
beings so as to avoid collision. In the more extreme cases, after the collision 
happened, the robot needs to be compliant with the collision. As to the robot itself, 
restricted by the mechanical structure, driving system and other factors, the robot 
must meet its own behavior constraints such as angle, angular speed limit, efc.; in 
addition, in order to improve the flexibility of the system, the new generation robots 
usually have redundant degrees of freedom, using their own structural character- 
istics to achieve the optimization of specific performance indicators. Therefore, the 
study of robot compliance control under complex space and behavior constraints is 
one of the key technologies to achieve human-robot integration. 

Neural network can simulate the working mechanism of biological neural sys- 
tem, learn from the environment and realize information processing. Among them, 
the dynamic neural network has the characteristics of adaptability, nonlinearity, 
parallelism, distributed storage and so on. It can be used to deal with the complex 
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problems which are difficult to be solved by traditional methods. At present, 
dynamic neural network has made a great progress in deep learning, estimation and 
prediction, image processing, complex system control and other fields. 

In this book, mainly focusing on the safe control of robot manipulators, we 
design dynamic neural network based control schemes for robots with redundant 
Degrees of Freedom (DOFs). The control strategies include adaptive tracking 
control for robots with model uncertainties, compliance control in uncertain envi- 
ronments, obstacle avoidance in dynamic workspaces. The idea for this book on 
solving safe control of robot arms was conceived during the industrial applications 
and the research discussion in the laboratory. Most of the materials in this book are 
derived from the authors’ papers published in journals, such as IEEE Transactions 
on Industrial Electronics. The robots considered in this book include SCARA and 
collaborative robots (such as Kinova JACO, and LBR iiwa). Therefore, the control 
methods developed in this book can be used in real applications after proper 
modification. To make the contents clear and easy to follow, in this book, each part 
(and even each chapter) is written in a relatively self-contained manner. 

This book is divided into the following 6 chapters. 

Chapter 1 In this chapter, an adaptive tracking controller is designed for 
redundant manipulators. Model uncertainties and repeatability are considered. The 
control scheme requires neither joint accelerations nor cartesian velocity, which is 
more suitable in practical engineering. By using pseudo-inverse method, repeata- 
bility is optimized in the null space of the Jacobian, the continuity of joint speed is 
also guaranteed. Future studies will concentrate on the experimental validation 
of the proposed controller. 

Chapter 2 An adaptive kinematic identifier is used to learn kinematic parameters 
online, and a dynamic neural network is presented to solve the redundancy reso- 
lution problem. The interplay of the adaptive online identifier and the neural con- 
troller makes it a coupled system with nonlinearity. Using the Lyapunov theory, the 
global converges of tracking error is theoretically verified. Numerical experiment 
results and comparisons based on a JACO, robot arm illustrate the effectiveness 
of the proposed algorithm and demonstrate advantages over existing ones. The 
Jacobian adaption strategy together with recurrent neural network (RNN) achieves 
task space tracking both in static and dynamic situations. Pseudo-inverse calcula- 
tion of Jacobian matrix is avoided, so that the real-time performance of controller is 
guaranteed. The boundedness of joint speed can also protect the robot and enhance 
the safety performance. Before ending this chapter, it is worth pointing out that this 
is the first kinematic regression based dynamic neural model for self-adaptive 
redundant manipulator motion control, with provable convergence and guaranteed 
performance bounds. 

Chapter 3 In this chapter, we propose an adaptive admittance control method for 
redundant manipulators based on RNN, in which model uncertainties of both 
interaction model and physical parameters are taken into consideration. Theoretical 
derivation using the Lyapunov technique shows the convergence of the proposed 
adaptive RNN, and numerical results on a 7-DOF robot iiwa demonstrate the 
effectiveness of the proposed control strategy. Compared with existing control 
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methods, the proposed controller shows good performance not only in handling 
physical constraints, but also in eliminating the calculation of pseudo-inversion. At 
last, it is remarkable that this is the first time to extend RNN based method to the 
case of force control for redundant manipulators, especially the ones with model 
uncertainties. This study will be of great significance in industrial application such 
as grinding robots, assembling robots, etc. 

Chapter 4 In this chapter, a novel obstacle avoidance strategy is proposed based 
on a deep recurrent neural network. The robots and obstacles are presented by sets 
of critical points, then the distance between the robot and obstacle can be 
approximately described as point-to-points distances. By understanding the nature 
escape velocity methods, a more general description of obstacle avoidance strategy 
is proposed. Using Minimum-Velocity-Norm (MVN) scheme, the obstacle avoid- 
ance together with path tracking problem is formulated as a Quadratic Planning 
(QP) problem, in which physical limits are also considered. By introducing model 
information, a deep RNN with a simple structure is established to solve the QP 
problem online. Simulation results show that the proposed method can realize the 
avoidance of static and dynamic obstacles. 

Chapter 5 In this chapter, a novel collision-free compliance controller is con- 
structed based on the idea of QP programming and neural networks. Different from 
existing methods, in this chapter, the control problem is described from an opti- 
mization perspective, and the compliance control and collision avoidance are for- 
mulated as equality or inequality constraints. The physical constraints such as 
limitations of joint angles and velocities are also taken into consideration. Before 
ending this chapter, it is worth pointing out that it is the first RNN based compliance 
control method, which considers collision avoidance problem in real time, and also 
shows great potential in handling physical limitations. In this chapter, simple 
numerical simulations in MATLAB are carried out to verify the efficiency of the 
proposed controller. In the future, we will check the control framework with dif- 
ferent impedance models in physically realistic simulation environments, and then 
consider machine vision technology and system delay problem on physical 
experimental platforms. 

Chapter 6 This paper focuses on motion—force control problem for redundant 
manipulators, while physical constraints and torque optimization are taken into 
consideration. Firstly, tracking error and contact force are modelled in orthogonal 
spaces, respectively, and then the control problem is turned into a QP problem, 
which is further rewritten in velocity level by rewriting objective function and 
constraints. To handle multiple physical constraints, a RNN based scheme is 
designed to solve the redundancy resolution online. Numerical experiment results 
show the validity of the proposed control scheme. Before ending this paper, it is 
noteworthy that this is the first paper to deal with motion—force control of redundant 
manipulators in the framework of RNNs and redundant manipulators with force 
sensitivity, e.g., grinding robots, can be readily controlled with the proposed RNN 
model but cannot with existing RNN models in this field. 


x Preface 


At the end of this preface, it is worth pointing out that, in this book, some 
distributed methods for the cooperative control of multiple robot arms and their 
applications are provided. The ideas in this book may trigger more studies and 
researches in neural networks and robotics, especially neural network based 
cooperative control of multiple robot arms. There is no doubt that this book can be 
extended. Any comments or suggestions are welcome, and the authors can be 
contacted via e-mail: shuaili@ieee.org (Shuai Li). 
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Chapter 1 A) 
Adaptive Jacobian Based Trajectory sie 
Tracking for Redundant Manipulators 

with Model Uncertainties in Repetitive 

Tasks 


Abstract Tracking control of manipulators, which is also called kinematic control, 
has always been a fundamental problem in robot control, especially for redundant 
robots with higher degrees of freedom. This problem would become more difficult 
for systems with model uncertainties. This chapter presents an adaptive tracking 
controller that considers uncertain physical parameters. Based on the realtime feed- 
back of task-space coordinates, by updating the motion parameters online, a Jacobian 
adaptive control strategy that does not require cartesian velocity and joint acceleration 
is established, which makes the controller much simpler. Then the Jacobian pseudo- 
inverse method is used to obtain the optimal repetitive solution as a secondary task. 
Lyapunov theory is used to prove that the tracking error of the end effector could 
asymptotically converge to zero. Numerical simulations verify the effectiveness of 
the proposed method. 


1.1 Introduction 


Robots have been widely used in industrial, agricultural, aerospace and other fields. 
Therefore, research on robotics, especially robot control technology, has been a hot 
issue in recent decades [1-5]. In order to improve the operation accuracy of the robot, 
tracking control has always been a fundamental problem in robot control, which has 
attracted wide attention from researchers. 

The tracking control of manipulators can be divided into two categories: joint 
space tracking and task space tracking. The target of joint space tracking is to design 
a controller to drive each joint of the robot to track a predetermined trajectory (see, 
for example [6, 7] and references therein). Another direction of tracking control is 
task space tracking, which is to establish the desired trajectory in cartesian space. 
Since the control commands do not match the target (control commands are sent 
to the actuators of each joint, and then the end effector is controlled to execute in 
Cartesian space, the mapping between the two spaces is highly nonlinear), task- 
space tracking is more difficult than joint space. Therefore, we should first solve the 
kinematic inverse problem, that is, obtain the required joint space position or speed, 
and realize task space tracking. This can be done off-line or online. The desired path 
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in cartesian space is discretized into a set of key points, and the corresponding joint 
configurations are determined in turn, and the desired joint velocity and acceleration 
are obtained by interpolation [8]. Similar studies can be seen in [9, 10]. This method 
is currently widely used in industrial applications, but it will have a certain impact 
on the real-time performance of the system. For redundant manipulators, there is 
an infinite section configuration corresponding to a particular Cartesian description. 
Therefore, the second task can be accomplished by adjusting the joints, such as 
avoiding obstacles and optimizing energy consumption. 

With full knowledge of physical parameters, a series of studies on real-time con- 
trollers can be found in [11-14]. In fact, robots usually have model uncertainties, 
including kinematic uncertainties, which may be caused by processing and mea- 
surement errors. On the other hand, robots may work with different tools, which 
can also lead to model uncertainties. The parametric drift may lead to inaccuracy 
in Jacobian, resulting in performance degradation or unpredictable response, which 
should be compensated. Before designing the controller, several calibration methods 
are proposed to determine the exact parameters [15, 16]. With the development of 
optical technology, researchers could measure the exact position and direction of the 
end-effector online. A series of real-time tracking controllers are proposed. Liu et 
al. proposes an adaptive tracking scheme based on online learning of the Jacobian 
matrix, by discussing the selection of control gain in detail, the authors prove the 
stability of the closed-loop system [18]. In [19], a robust controller considering actu- 
ator saturation is designed. Lyapunov theory indicates the semi-global stability of the 
system. In [20], a dynamic regulation controller is also established, which consists of 
a transposed Jacobian operator and a gravity compensator. When the required path 
is variable, Cheah et al. propose a passive tracking controller [21], which proves the 
global convergence of the tracking error. Liu et al. use the fuzzy logic system to 
understand the uncertainty of the robot model, and design a tracking control scheme 
based on sliding mode control. However, these studies require cartesian velocity or 
joint acceleration, which is actually difficult to obtain due to hardware constraints. 
Therefore, Wang et al. propose a tracking controller based on a low-pass filter, which 
omits the cartesian velocity measurement [22]. Similar studies can be seen in [23— 
25]. The above research mainly focuses on the general problems of position control, 
the physical uncertainty of robots and ignores the secondary tasks. 

Based on the above research, this chapter studies the motion control of redundant 
manipulators, in which we take the uncertain kinematic parameters into account. In 
practice, robots are usually scheduled to perform periodic tasks, therefore, we choose 
repeatability as a secondary task. In order to avoid the measurement of velocity and 
joint acceleration in task space, a new adaptive controller is designed, which achieves 
secondary tasks by optimizing the functions in null space of Jacobian matrix. We 
also provide stability analysis and numerical simulations. 

The remainder of this chapter is organized as follows. In the second part, we 
will introduce the basic kinematics of redundant robots and give several important 
properties that will be used in the following sections. In the third part, the proposed 
adaptive controller is discussed in detail, including an adaptive method and repeatable 
optimization of model parameters. The convergence analysis of tracking error is 
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discussed. In Sect. 1.4, we provide examples and numerical simulations to verify 
the effectiveness of the proposed tracking method. Finally, Sect. 1.5 concludes the 
chapter. Before concluding this section, we emphasize the main contributions of this 
chapter as follows: 


This chapter mainly studies the adaptive kinematic control in the presence of model 
uncertainties, which is of great significance in practical engineering. 

In the controller design process, there is no need to measure the task space velocity 
and joint acceleration. Therefore, the controller can be easily applied in actual 
systems. 

The contribution also lies in that the proposed repeatability optimization scheme is 
guaranteed by introducing a variable coefficient, which could ensure the continuity 
of the joint speed. 


1.2 Problem Formulation 


Without loss of generality, the robot manipulator studied in this chapter is selected as 
a serial robot, which is most commonly used in industrial applications. The kinematic 
model of a serial robot manipulator is 


JOE) = x(t), (1.1) 


where 6(t) € R” is the joint angles, and x(t) € R” is the vector describing the posi- 
tion and orientation of the end-effector in cartesian space. f(e): R” > R” is the 
forward kinematic mapping of the robot. f (e) is a nonlinear function. Differentiating 
x(t) with respect to time ¢, the cartesian velocity x(t) is formulated as 


X(t) = J (00), aÊ), (1.2) 


where J(gO(t), ap) = 3 f (O(t), ap)/30 (t) € R”*” is the Jacobian matrix. As to a 
redundant manipulator, n > m. ag € R’ denotes the vector of kinematic parameters, 
also called physical parameters, while in this chapter, mainly refers to length of each 
joint. Therefore, a; is considered as a constant vector. 

The movement of the end-effector J(@(t), apt) consists of two parts: phys- 
ical parameter dependent term and joint angle-speed dependent term, and can be 
described in the linearization-in-parameter form [21]: 


J(O(t), aÔ E) = Yk (0 (t), O(t) ax, (1.3) 


where Y; (O(t), 6 (t)) € R”*! is called kinematic regressor matrix. 
In this chapter, to avoid measuring the task-space velocity, a low-pass filter is used 
as follows 
ytary =Aix, (1.4) 
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where A; is a positive constant and y is the filtered output of the task-space velocity 
with initial value y(0) = 0. Rewriting (1.4) leads to 


y=Aix/(p+A1), (1.5) 


where p is the Laplace variable. 
Combining (1.3) and (1.5), we have 


y = Wiar, Welt) = MYLO, 0)/(1 + p), (1.6) 


where W;(0) = 0. For simplicity, we write J (0), Y} (0, ô) as J and Yz, respectively. 


Remark 1.1 In real applications, a, has two different forms, which correspond to 
different meanings. The first value is the actual value of ag, the other one is the 
nominal value of a;. a; is usually a non-calibrated measurement result of ap, which 
is usually provided by the manufacturer or manual measurement. The real value of a, 
is difficult to obtain in real applications, and a, is generally not the same as its nominal 
value of a;;. Due to assembly errors and long-term operations (such as friction, wear, 
etc.,) besides, robots may operate different tools to perform tasks, which also leads 
to motion uncertainty. In this case, the direct use of a? control method can lead to 
control errors, which is unacceptable in high precision tracking control. 


1.3 Main Results 


In this section, we will show the detailed process of the controller design. Firstly, 
the ideal case where all parameters are known is considered, then the basic idea 
of parameter-updating-in-realtime is designed in the case where parameters are 
unknown, and is then expanded to repeated optimization in null space. Finally, the 
stability of the closed-loop system is discussed. 


1.3.1 Adaptive Tracking Method 


Define the tracking error in Cartesian space as 
e(t) = x(t) — xa(t), (1.7) 
(1) Known parameter case 


When the kinematic parameters a, is perfectly known, the accurate Jacobian matrix 
J can be obtained, therefore, the reference trajectory can be designed as 


H(t) = Xq(t) + kižalt) — kelt) — ki O(t), (1.8) 
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where kı and kz are positive control gains. According to Eq. (1.2), the Eq. (1.8) can be 
reformulated as ¥ (t) = Xq(t) + ky xa(t) — koe(t) — k\ x(t), by calculating the second 
derivative of Eq. (1.7), and substituting Eq. (1.8), we have 


elt) = X(t) — Xa(0) 
= ki xq(t) — ke(t) — ky x(t). (1.9) 
Eq. (1.9) can be rewritten as 
elt) + kilt) + ke(t) = 0, (1.10) 


it is obvious that is e(t) will eventually converge to zero, if kı and kz are Hurwitz. 
Combining Eqs. (1.8) and (1.2), and letting initial joint velocity 0 (0) be 0, one can 
easily derive the corresponding control signals of joint speed as below 


6 =0; + On (1.11a) 
t 

6; =f JÝ [Gia + kia — kze — ki J) — JÖ] dt (1.11b) 
0 

6, =U — Jt J)a, (1.110) 


where J is n-dimensional identity matrix, J? = JT (J JT)™! is the Pseudo-Inverse of 
J, and 6,, is a speed component in the null space of Jacobian, œ can be selected arbitrar- 
ily. It is notable that J 6, = 0, indicating that the speed component in the null space 
has no influence on the movement of end-effector. By getting the time-derivative 
of Eq. (1.2) and substituting Eqs. (1.11), (1.2), (1.7), one can easily verify that the 
error dynamics under kinematic controller Eq. (1.11) is the same as Eq. (refeq10), 
the tracking error will gradually converge to 0. 


Remark 1.2 Equation (1.8) gives fundamental description of reference trajectory in 
the Cartesian space, it is notable that all the required information except J(@, ax) on 
the right side of the equation is easy to obtain. This inspires us to design a similar 
control strategy with the existence of kinematic uncertainties. 


(2) Unknown parameter case 

In this situation, since the exact value of a, is unknown, J is unknown. Therefore, 
in this case, we use J instead of J by replacing a, with its estimation â, and let 
a(0) = aj, then the estimated x(r) is X(t) = JO. By replacing ag by âg, according 
to (1.3), the estimated cartesian speed Â satisfies 


£=JG=Y¥;%(q, Dax, (1.12) 


The modified reference trajectory is thus designed as 


t š 
6(t) = [atta + a + oia kikoe — JO — kze] — (kı + ko) O}dt. (1.13) 
0 
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Since the accurate feedback of cartesian velocity x is unavailable, the derivative of 
tracking error é = x — Xq is also unknown, therefore, we define the alternative value 
of é by using the estimated Cartesian speed x: 


Ak =k —ig = JO — Xa, (1.14) 
then the updating law of kinematic parameters is designed as 
år = ki Y (AR + kie) + ks Yg e — WE (DDW Dâr- y) (1.15) 


where T} is a positive definite diagonal matrix, kı, k2 and k3 are positive control 
gains. 


Remark 1.3 Without loss of generality, the initial value of the estimated kinematic 
parameters can be selected according to the nominal value, which can be obtained 
from handbook or manual measurement. In fact, the adjustment of â; (0) does affect 
the tracking process, which can be verified in the next section. The greater the error 
between 4;(0) and az, the greater the initial simulation error. However, according 
to (1.15), no matter what the exact value of â;(0), the estimated value of a, will 
eventually converge to ag, which can be verified by stability analysis and numerical 
experiments. 


Now, we are ready to offer a theorem about the task-space tracking problem for 
robots with uncertain physical parameters using the proposed adaptive controller as 
below. 


Theorem 1.1 The control error e(t) for a redundant manipulator described by (1.7) 
would globally converge to 0, provided the joint speed controller described as (1.13), 
along with the kinematic adaptation law (1.15). 


Proof Differentiating (1.7) and substituting (1.3) and (1.12), we have 


é=K-K+4K—Hg 
= Yeap — Ykâk +X — ža 
= -Y + Ax, (1.16) 


Taking the time derivative of Ax and combining Eqs. (1.14) and (1.16) derives 


d ž te mes 
gD =f? + JO — ie 
= (kı + ky) ka — kike — kze — ka JÒ — kı FO 


koža — k2 JO — kıkze — kze + kıža — kı (ža + é + Ykãy) 
= =k Ak — kokie — kze — ki Yue — ke, (1.17) 


where p = ap — â represents the difference between the real value of physical 
parameters a; and the estimated one âz. Eq. (1.17) can be written as 
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ds; x 
q4 kie) = —ka(Ax + kie) — kze — kı Vy. (1.18) 


Select a Lyapunov function candidate as follows 
V = (Ax + kie) (Ak + kie)/2 + kze"e/2 + Gf a /2. (1.19) 


By taking the time derivative of (1.19) and combing (1.15), (1.16) and (1.17), we 
have 


V = (Ak + kie) d(AÎ + kje) /dt + kze"ė + a) ay 

= (Ak + kje)" (—ky (AX + kie) — kze — ky Yay) + af (k1 YF (AX + kie) + k3 Yf e 
-WE (T1 (We (tax — y) + kze" (—Ykăk + AX) 

= ky (A + kje) (Ax + kie) — kykgete — af WE (T1 We (tay 

<0. (1.20) 


Then we can obtain that Ax, e and ã are all bounded. Based on Eqs. (1.14) and 
(1.3), J6, a and Y,a, are also bounded. Notably that W;,(t)d, is the output of a 
stable system with bounded input Y;(t)a,, we have W;,(t)a, is also bounded. Then 
according to Eq. (1.15), a; is bounded. Differentiating W;,(t)da, with respect to time, 
we have 


d : 
q WO4 = Ai (Yk — Wet) ae + Wet) ak. (1.21) 


d(W, (t)a,)/dt is also bounded. Then we have è, d(Ax)/dt and d(W, (t)a,)/dt are all 
bounded, which means the time derivative of (1.20), V is bounded. Using Barbalat’s 
Lemma, we have Ax + kie ~ 0,e > O,ast > œ. 


Remark 1.4 We have proved the convergence of the tracking error under the con- 
dition of kinematic uncertainties. In fact, when ax is perfectly known, Eq. (1.13) will 
be degenerated as 


t 
A(t) = f [Ji ig + (ky + ko)ža — kikze — JG — kze) — (ky + ka)ôldt, (1.22) 
0 


which has the similar form compared with Eq. (1.11). Therefore, known parameter 
case described in Eq. (1.11) can be considered as a special form of Eq. (1.13). 


Remark 1.5 The control velocity g in Eq. (1.13) is not the final result of this chapter. 
The velocity component in null space is ignored, although it has no effect on the 
movement of end-effector as well as the stability proof, this part can not be neglected, 
because the redundancy mechanism is of great engineering significance to the manip- 
ulator. 
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Algorithm 1 The proposed tracking method 


Input: Parameters kı, k2, k3, K, I, €, initial states 6(0) = 0, 6(0), nominal kinematic parame- 
ter 4 (0), desired path xq(t), Xq(t) and ¥ą(t), task duration T,, feedback of end effector x(t), 
analytical expressions of estimated Jacobian matrix J and kinematic regressor matrix Yz. 

Output: To achieve task-space tracking of the redundant manipulator 

Initialize ag (0) < aj. 

x, 0, q < Sensor readings 

Calculate e, y, and Wz (t) by Equation (1.7), (1.5) and (1.6) 

Update K by Equation (1.27) 

Update 6; by 6; < Equation (1.26b) 

Update 6, by 6,, < Equation (1.26c) 

calculate the output 6 by Equation (1.26a) 

. Update a, by a, < by Equation (1.26d) 

Until > T.) 


SIAR NS 


(3) Repeatability optimization 
In this subsection, in order to make full use the of redundant design of a redundant 
manipulator, a repeatability optimization scheme is developed in the null space of 
the Jacobian matrix, which is helpful to improve the stability and reliability of robots 
in periodic tasks. 

Define a following function to describe a robot’s repeatability as 


F(0) = —K (8 — Gini)’ (0 — Oni) /2, (1.23) 
where K is a positive parameter scaling the weight of repeatability optimization, 6; n; 


is the initial value of the joint angles. By using gradient descent method, a velocity 
component in null space can be calculated as 


a = [Ə (F (0))/3 (01), +++ , 0(F@))/9Gn)].- (1.24) 
Combining Eqs. (1.24) and (1.23), we have 
& = [Oin (1) — 01), + , Oin (i) — OC), +++ p Oin) — ONT. (1.25) 
where Oini (i) and 0 (i) represent the ith element of 6;,,; and 0, respectively, i = 
lees yn. 


Then the complete form of the proposed adaptive controller is 


6 =6; + (1.26a) 


t 
y= | U'Gat te + koia kykoe — JO — kze) — (kı + kx) ]dt (1.26b) 
0 


Bn =U — ITI) int) — OCD), -++ ine (i) — OG), +++ y Oin (n) — O(n)" (1.260) 
ay =k YE (AS + kie) + k3 Yl e — WE OM (Wea — y) (1.26d) 
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Fig. 1.1 change curve of K with time t 


Remarkable that at the beginning stage of the tracking cycle, the repeatability is 
less important, and then it rises as the task continues. To this end, we set K as a 


variable: 
0 NT <t < NT +T/2, 


= 2NT +T 1.27 
K* <tr ba 


K 
where K* = Kmax(1 — cos(a (t — NT — T/2)/T), N =0,1,2,... are natural 
numbers, T is the period of cyclic motion. If t < NT + T/2, the robot has just 
left the initial state to perform a task, thus we let K = 0, this will cause a = 0, the 
joint control velocity is the same as (1.13). Whent > NT + T/2, K increases from 
0 to maximum value Kj, with time, forcing the robot to repeat the initial state. The 
change curve of K with time is shown in Fig. 1.1. 


Remark 1.6 The main reason for this selection of K is to ensure the continuity 
of joint speed signals during a motion cycle. Notable that the discontinuities of K 
still appear at the moment T = NT. If the robot can repeat the initial joint state, 
0 — ini would converge to 0, so œ can be also regarded as continuous. Therefore, 
the definition of K in (1.27) is acceptable. 


1.4 Numeral Simulations 


In this section, several groups of numerical experiments are carried out to show the 
effectiveness of the designed controller. Firstly, a comparative simulation is given 
to show that the adaptive tracking law could achieve a satisfing performance with 
the existence of kinematic uncertainties. Secondly, we will check the performance 
in periodic tasks. Finally, more general trajectories are discussed to show the adap- 
tiveness and robustness of the control algorithm. 
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Joint a, a d, 0 
l 0.3 m 0 rad 0m Orad 


2 
3 0.1 m 0 rad 0m Orad 
4 0.2 m 0 rad Om Orad 


Fig. 1.2 The 4-DOF redundant manipulator to be simulated in this chapter. Left: Physical structure 
of the 4-link robot manipulator. Right: D-H parameters 


1.4.1 Simulation Settings 


The vector of initial joint angles is selected as ini = [7 /2, —7x /2, 0, O]'rad, and 
the corresponding cartesian position is Xini = [0.6, 0.3]'. Since the exact value of 
kinematic parameters (see d; in Fig. 1.2), we assume the nominal values to be a; = 
[0.25, 0.25, 0.12, 0.18]"m, and let â (0) = aç. The control gains kı, kz and k; are set 
to be ky = 50, k2 = 50, k3 = 50, I’ = 10. As to the repeatable tasks, the parameter 
scaling the velocity component in the null space is selected as Kmax = 10. The time 
constant of low-pass filter is à = 40. It is notable that matrix J is essential in the 
proposed tracking controller, which is used to estimate the actual Jacobian matrix 
J (q, ax). To further show the detail of the proposed controller, analytical expression 
of J is given in Appendix. 


1.4.2 Verification of Parameter Estimation 


Comparative simulations is firstly carried out to show the effectiveness of the pro- 
posed updating law (1.15). The desired path to be tracked is defined as xg(t) = 
0.4 + 0.2cos (2t), ya(t) = 0.3 + 0.2sin(2r). In the first simulation, the nominal val- 
ues are used directly in the tracking control according to Eq. (1.13). By contrast, a, 
is updated using (1.15) in the comparable simulation, and a is set to be zero (i.e., 
we didn’t use repeatability in this part). Simulation results are shown in Fig. 1.3. 
Both controllers ensure the boundedness of the tracking error. When a; is known, 
benefiting from the closed-loop control mechanism, the tracking errors along two 
axes are much less than 5 mm. The tracking errors with parameter estimation are less 
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Fig. 1.3 Error profile with and without parameter estimation when tracking a circle. a Track- 
ing errors without parameter estimation. b Tracking errors with parameter estimation. c Norm of 
tracking errors with and without parameter estimation 


than | mm. Fig. 1.3c shows comparative results of tracking error norm correspond- 
ing to known and unknown ag, intuitively showing the effectiveness of the proposed 
controller under the condition of unknown models. 


1.4.3 Verification of Repeatability Optimization 


Then we check the effectiveness of repeatability optimization. Based on the simula- 
tion of the previous part, we introduce the proposed repeatability optimization scheme 
(i.e., the controller is the same as the adaptive tracking controller in the previous part 
except a Æ 0.) Simulation results are shown in Fig. 1.4. The curve of tracking error 
e is the same as the one when a = 0, showing the property that the velocity com- 
ponent in null space have no influence on the cartesian movement (Fig. 1.4a). The 
estimated kinematic parameters â; are shown in Fig. 1.4b, which slowly converges to 
ap with time. The error norm ||Ypâk — X||2 of the estimated cartesian speed reduced 
to zero quickly, as shown in Fig. 1.4c. The curve of the repeatability function is 
shown in Fig. 1.4d, we can observe that when t = T, 27,---, ||¢ — dini||2z equals to 
zero. This is to say, when repeatability optimization is used, ||g — gini||2 changes 
periodically. Figure 1.4e shows the motion trajectory tracked by its end-effector of 
the robot manipulator, illustrating the precise tracking of the circle desired trajectory. 


1.4.4 Cardioid Tracking 


To further verify effectiveness of the proposed control scheme, the manipulator is 
required to track a cardioid trajectory in 2-D workspace. The desired path is defined as 
xa(t) = 0.1 (2sin(2t) — sin(4t)) + 0.6 m, ya(t) = 0.1 (2cos (2t) — cos (4t)) + 0.2 
m. Simulation results are shown in Fig. 1.5. The motion trajectory achieved by its 
end-effector of the manipulator is shown in Fig. 1.5a. The corresponding tracking 
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Fig. 1.4 Simulation results with parameter estimation when tracking a circle. a Tracking error 
profile. b Estimated parameter âx. c Difference between the estimated value Ya, and the real one. 
d ||q — qini ||2 with repeatability optimization. e Motion trajectory 


errors are given in Fig. 1.5b, showing that the robot successfully tracks the given 
trajectory. ||¢ — qinill2 is guaranteed to 0 when t = T, 2T, 3T (Fig. 1.5e), and the 
estimated kinematic parameters are shown in Fig. 1.5c. All in all, the proposed con- 
troller ensures stable tracking under the condition of model uncertainties, and the 
repeatability is also achieved. 


1.5 Summary 


In this chapter, an adaptive tracking controller is designed for redundant manipulators. 
Model uncertainties and repeatability are considered. The control scheme requires 
neither joint accelerations nor cartesian velocity, which is more suitable in practical 
engineering. By using the pseudo-inverse method, repeatability is optimized in the 
null space of Jacobian, the continuous of joint speed is also guaranteed. Future studies 
will concentrate on the experimental validation of the proposed controller. 
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Fig. 1.5 Simulation results when tracking a cardioid curve. a Motion trajectory of the manipulator. 
b Tracking error. c Estimated parameter âx. d Difference between the estimated value Y;,a, and the 
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Appendix 


Given the joint angle 6 = [01, 62, 03, 64]' and the estimated a = [â (1), a (2), 
âr (3), âk (4)]'. By simplifying cos(6;) = ci, sin(6;) = si, âk (i) = aj, the analyti- 
cal expression of J is given as below. 


J (1, 1) = —a151 — a2812 — 435123 — 4481234, 
J(1, 2) = —a2512 — 438123 — 4451234 
J (1, 3) = —a38123 — 4451234 


J (1, 4) = —a451234 
J (2, 1) = ayes + a2C12 + 43C123 + 4c 1234 
J (2, 2) = agei2 + a3€123 + 44C1234 
J (2, 3) = a3ci23 + a4ci234 
J (2, 4) = agcyo34. ; 
Based on the analytical expression of J given above, J can be formulated as 
follows. 
J(1, 1) = —a1c101 — az2c12(091 + 62) — a3c123(01 + 62 + 03) 
; — a4c1234(01 + 62 + 03 + 4) 
J (1, 2) = —azc12(01 + 62) — a3c123(A1 + 02 + 03) — a4c1234(01 + 02 + 03 + 04) 
J (1, 3) = —a3c123 (01 + 02 + 03) — a4c1234(01 + 02 + 03 + 04) 
J (1, 4) = —a4c1234(01 + 2 + 63 + 04) 
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J (2, 1) = —a516) — asn Ê + 62) - a38173(01 + 62 + 63) 

: — 4451234(O1 + 02 + 03 + 64) 

fo, 2) = —a2512(6; + 62) — a35123(6) + Ês + 63) — 4451234(01 + Ô> + 63 + 4) 
F(2, 3) = —a35123(61 + 62 + 63) — a4s1234(01 + 62 + 63 + ĝa) 

JF (2,4) = —aasyr34(61 + 62 + 63 + 64). 
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Chapter 2 A) 
RNN Based Trajectory Control for PRE 
Manipulators with Uncertain Kinematic 
Parameters 


Abstract Among tracking control of redundant manipulators, potential limitations 
such as model uncertainties and physical limitations may exist. Conventional solu- 
tions may fail when model parameters differ from nominal ones. In this chapter, 
a novel kinematic controller with the capability of self-adaptation is proposed 
to address this challenging issue. Based on the coordinate feedback, a Jacobian 
adaption strategy is firstly built by updating kinematic parameters online. Using 
Karush-Kuhn-Tucker conditions, the redundancy solution problem is then turned 
into a quadratic optimization one, and a recurrent neural network based controller 
is designed to derive the optimal solution recurrently. Theoretical analysis demon- 
strates the global convergence of the tracking error. Compared with existing methods, 
kinematic model uncertainty of the robot is allowed in this chapter, and the pseudo- 
inverse of Jacobian matrix is avoided, with the consideration of physical limitation 
in a joint framework. Numerical experiments based on Kinova JACO, show the 
effectiveness of the proposed controller. 


2.1 Introduction 


With the development of mechanics, electronics and computer technology advance, 
using robot manipulators is becoming popular in modern manufacturing such as 
welding, painting, assembling, etc [1—4]. Among these applications, tracking control 
of manipulators, focusing on the calculation of control actions to drive the robot to 
move along the user-defined trajectory in Cartesian space, is always a core problem 
in robot control, and has been studied intensively by researchers in recent decades. 

Redundant manipulators have more degrees of freedom (DOFs) than those 
required to accomplish a given task [5], and have shown great potentials in enhancing 
robot flexibility, dexterity, and versatility, avoiding obstacles [6-9], and optimizing 
energy consumption [10]. However, the nonlinear function description from the joint 
to Cartesian space, as well as the redundancy in DOFs, makes it a challenging prob- 
lem to achieve precise tracking control of redundant manipulators. 

In recent decades, some results on resolving the redundancy of manipulators have 
been reported. In most approaches, the problem is solved at the velocity or accelera- 
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tion level, namely, to derive the corresponding joint velocity or acceleration according 
to the trajectory description in Cartesian space. Masayuki et al. [12] propose a redun- 
dancy solution method for a S-R-S redundant manipulator at the angle level, in which 
analytic solutions are firstly derived, and analytical methods for joint avoidance is 
then considered. However, this method is effective only for robots with a specific con- 
figuration and is not scalable to manipulators with a general mechanical structure. To 
solve the kinematic control problem with general configurations, some controllers are 
proposed, including adaptive control methods [13, 14], barrier-Lyapunov-function 
based methods [15, 16], and Jacobian-matrix-pseudo-inverse (JMPI) methods [17- 
19]. In [17], an asymmetric barrier Lyapunov function-based method is introduced 
to handle the output limitation. This method consists of a full state feedback con- 
troller and an output feedback controller. Using the JMPI method, one can get the 
control signals in joint space according to the desired path and the pseudo-inverse 
of the Jacobian matrix. For a redundant manipulator, there is a null space for the 
Jacobian matrix [20], which is helpful to design controllers considering a secondary 
task. Therefore, JMPI based methods have been widely used in redundancy solutions. 
Galicki [21] proposes a JMPI based tracking controller, and an alternative method 
around the singular point is discussed. In [22], a weighted damped least-squares 
method is developed to calculate pseudo-inverse around the singularity, an appro- 
priate damping factor is derived according to the minimum singular value. In [23], 
pseudo-inverse of Jacobian is calculated online by a Taylor-type discrete-time neural 
network, which is composed of T-ZNN-K and T-ZNN-U models. In [24], a special 
type of nonlinear function based neural network is designed for tracking control of a 
PA10 manipulator, and the finite-time convergence of tracking error is also analyzed. 

Although the above-mentioned methods have achieved great success, those meth- 
ods are afflicted with several major limitations in scenarios that require higher per- 
formance of real-time ability, accuracy, and self-adaptation. Firstly, the precise kine- 
matic parameters are required in existing works. Describing the mapping from the 
joint movement in joint space to the movement of the end-effector in Cartesian 
space, the Jacobian matrix contains kinematic characteristics, such as configuration 
and kinematic parameters. For a specified robot, the configuration can be derived, but 
it is usually difficult to obtain accurate kinematic parameters. For example, because 
of the manufacturing error, different operation tools, etc., the DH parameters may 
differ to the reference ones in official guidebooks [25]. In this case, the Jacobian 
matrix based on the inaccurate parameters would cause errors in pseudo-inverse cal- 
culation and even instability of the system [26]. On the other hand, the calculation of 
pseudo-inverse operation is time-consuming, which would lead to a huge cost in real 
applications which requires pseudo-inverse calculation in every control cycle. Addi- 
tionally, due to mechanical reasons, the robot manipulator is suffered from physical 
constraints, such as joint angular and speed limitations. 

In terms of the kinematic control in the presence of model uncertainties, the real- 
time feedback of the end-effector enables closed-loop control for researchers. This 
can be done by high performance measuring devices such as high precision cameras 
and laser trackers [27]. In [28], based on the parameter linearization property, a 
robust controller is proposed, which shows semi-global stability in regulation control 
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in fix-point control. As to the tracking control of manipulators, Hou proposes a 
neural network based control strategy, in which the position/orientation of the robot 
is described by a unit quaternion, and the network is used to learn the unknown 
nonlinear part of the system. One main contribution of the research is that singularities 
associated with three-parameter representation can be avoided. Cheah et al. propose 
several adaptive controllers for manipulators in different industrial applications, such 
as visual tracking, force tracking and trajectory tracking [30-34]. In [35], Chen and 
Zhang design a new adaptive controller in the acceleration level, the basic idea is 
that the Jacobian matrix is updated in realtime rather than kinematic parameters. 
One major drawback of the strategy is that the controller requires the actual values 
of end velocity and acceleration, which may contain noise in actual applications. In 
order to reduce the influence of noise in sensor feedback, Wang introduces a low- 
pass filter, and then an adaptive torque calculation controller is designed in the inner 
loop [36]. Xu develops a modified controller [37], in which the joint command is 
deigned at the acceleration level. It is verified that the controller does not require 
measurement of end velocity and joint acceleration. The influence of the control 
parameters on tracking errors and convergence rate is also discussed. The above 
methods mainly focus on the uncertain model parameters, and the redundancy of the 
manipulators is not considered. Despite the pseudo-inverse can be used instead of 
traditional inverse calculation of Jacobian matrix, the disadvantage of JMPI methods 
remains unresolved. In order to overcome the limitations based on the JMPI method, 
researches transform this problem into a quadratic programming problem, with the 
aim of obtaining an optimal solution with the specified evaluation index under the 
physical constraint. Physical constraints can be formulated into equation constraints 
or inequality constraints. Zhang et al. [38] develops a dual neural network to solve 
quadratic programming problems, and it is shown that this strategy is suitable for 
redundancy solutions. Based on the idea, a series of research is reported in eliminating 
the position error accumulation [39], nonconvex optimization [40], acceleration-level 
compliance [41], parallel robots [42] and multiple robot systems [43]. 

Inspired by the above literature, in this chapter, we focus on the adaptive tracking 
problem for redundant manipulators. The remained of this chapter is arranged as 
below. In Sect. 2.2, fundamental robot kinematics together with useful properties are 
given, we also show the control objective. In Sect. 2.3, an adaptive Jacobian method 
is designed by updating kinematic parameters online, and a RNN is used to achieve 
redundant resolution based on the estimated Jacobian matrix, convergence analysis of 
the tracking error in Cartesian space is also discussed. In Sect. 2.4, numerical results 
and comparisons are conducted on a 6DOF robot JACO>. Finally, conclusions are 
drawn in Sect. 2.5. Before ending the introductory section, we highlight the main 
contributions of this chapter as below: 


e As far as we know, for the first time, this chapter proposes an RNN based controller 
via kinematic regressing for redundant manipulator subject to model uncertainties. 

e Using the Lyapunov theory, the convergence of tracking error is proved in the case 
of uncertain parameters. 
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e In this chapter, there is no need to calculate the pseudo-inverse of the Jacobian 
matrix, which can avoid singularity problems effectively, and also reduce compu- 
tational load. 

e The minimum velocity solution is derived by the proposed control scheme, thus 
the safety of both humans and robots can be also guaranteed. 


2.2 Problem Formulation and Existing Results 


2.2.1 Robot Kinematics 


Without loss of generality, we consider serial robot manipulators in this chapter. The 
kinematic model for robot manipulators is described as follows: 


SOA) = x(t), (2.1) 


where 0 (t) € R” represents the vector of the joint angles at time t, and x(t) € R” 
represents the Cartesian coordinate vector of the end effector. For a specific robot 
manipulator, f (e) : R” —> R” is used to describe the forward kinematics from joint 
space to Cartesian space, which is a continuous nonlinear mapping containing kine- 
matic parameters and structure information. 

By differentiating x(t) with respect to time t, we can get the relationship between 
Cartesian velocity x(t) € R” and joint velocity (or joint control signal) 6(t) € R” 
as follows: 

J (A(t), ak)O (t) = x(t), (2.2) 


with J (O(t), ak) = Of (O(t), ag)/d0(t) being the Jacobian matrix, and a € R' 
denotes the vector of kinematic parameters. 

Once the physical structure of manipulator is determined, its kinematic equation 
(2.2) satisfies the following linearization property [43], which describes the relation- 
ship between the robot’s end velocity and its kinematic parameters: 


J(O(t), aÔ E) = YL (0 (t), O(t))ax, (2.3) 


where Y} (0 (t), 6(t)) €e R”*! is called kinematic regressor matrix. Remarkable that 
Y;,(0(t), 0 (t)) is the function of 0 (t) and 0 (t), and has no relation with ag. 


2.2.2 Control Objective 


In this chapter, we consider the task space tracking problem for redundant manipula- 
tors, where precise values of kinematic parameters are unavailable. The measurable 
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states are joint angles 0 (t) and the end coordinates x(t). The desired Cartesian path 
xa(t) € R” and its time derivative x(t) are accessible, both xg(t) and xq(t) are 
bounded. The nominal value of the kinematic parameter vector is also available, 
which is denoted as ağ. 

The control objective is to generate joint velocity command in real-time, e.g., 
designing Ô) to drive the end-effector of a redundant robot to track xa(t) , in the 
sense that f (0 (t)) = x(t) > xa(t). During the ka tracking process, velocity of 
every joint Ê; (t) should not exceed its limits 6/ 


min? Oh axe 


2.3 Main Results 


In this section, we will show the detailed process of the controller design. When con- 
trolling aredundant robot, one important problem is to make use of its flexibility, such 
as avoiding obstacles, optimizing energy consumption, and avoiding Singularities, 
etc. In this chapter, when the kinematic controller is designed to achieve task-space 
tracking in the presence of physical uncertainties, we consider the energy-saving 
problem at speed level. Therefore, the secondary task in set to minimize the veloc- 
ity norm u'u. The control strategy consists of three parts: a position controller in 
the outer-loop, a Jacobian adaption part which is capable of handling kinematic 
uncertainties online, and an RNN which is used to solve the redundancy resolution 
problem. The stability of the closed-loop system will be also discussed. 


2.3.1 Position Controller 


Firstly, a precise measurement of actual coordinate x in real-time ż is taken to build the 
closed-loop system. The difference between the desired path and the corresponding 
feedback can be defined as 


elt) = xa(t) — x(t). (2.4) 


In order to make e(t) converge to 0, by using the zeroing dynamics [53], the derivative 
of e(t) is designed as 
e(t) = —ke(t), (2.5) 


with k > 0 being a positive constant scaling the convergence rate of e(t). Combining 
(2.4) and (2.5) yields 
X(t) = Xa(t) + kalt) — x (t)). (2.6) 


Let A(t) = u(t), according to (2.6), if u(t) is properly designed to make the robot’s 
end-effector move at a speed of x(t), in the sense that x(t) = J(@(t), ap)u(t), the 
tracking error e(t) in task-space would convergence to zero exponentially. 
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When a; is unknown, the precise Jacobian matrix described in (2.2) is unavailable. 
The redundancy solution can not be achieved by using J (6, az). Therefore, we use the 
estimate Jacobian J (6 (t), a) by replacing the unknown parameters a, in J (0 (t), ax) 
with its estimate a, and the initial value of a; is set as a, (0) = aj, the estimate error 
is defined as a, = ay — ax. Using J(6(t), â) and the control signal A(t), we can 
estimate the velocity of the end-effector as 


X(t) = IOO), âk (t))u(t). (2.7) 


Remarkable that the linearization property described in (2.3) still holds for estimated 
ak: 
J (OCE), ADUE) = YO), uO) )ax (0), (2.8) 


this property will be used in the following stability proof. The adaptive Jacobian 
method by updating its kinematic parameters âx is thus developed as 


a(t) = -DYF (0(t), uet), (2.9) 


where I, € R% is a diagonal positive definite matrix, e(t) is the tracking error 
in Cartesian space as defined in (2.4), and u(t) is the bounded joint speed vector 
satisfying J(6(t), a)u(t) = x(t), which will be designed later. Unless otherwise 
specified, J (0 (t), a) is simplified as J. 


Remark 2.1 Figure 2.1 gives a brief framework of the proposed control scheme for 
redundant manipulators with uncertain kinematic parameters. The desired trajectory 
of the end-effector is specified by xa(t) and xq(t). The desired trajectory together 


Outer-loop 
Controller 


Fig. 2.1 Framework of the proposed scheme for redundant manipulators with uncertain kinematics, 
in which the neural control algorithm includes three interactive modules, i.e., position control 
module, parameter identification module, and redundancy resolution module 
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with feedback x(t) are fed into the position controller (2.6). The tracking error e(t) 
and joint speed (t) are used to learn the kinematic parameters online by identifier 
(2.9). According to the output of the position controller, identified parameter âz, 
feedback of the manipulator and physical limits, an RNN based controller is used to 
achieve the redundancy solution problem. 


2.3.2 Redundant Solution Using RNN 


In this subsection, we focus on redundancy resolution problem based on Jacobian 
adaption method introduced in Sect. 2.3.1. The main purpose of redundancy res- 
olution is to find an optimal joint speed u(t) to make equation J(@(t), a,)u(t) = 
Xa(t) + kle(t)|’sgn(e(t)) hold, at the same time, a secondary task can be also 
achieved. The redundancy resolution problem can be converted into a quadratic 
optimization one with specified constraints. To minimize the kinetic energy of the 
robot, we select velocity norm uTu = 676 as an object function to be optimized, the 
joint range Olin < 6; < 6/,,, and joint speed limits Öl in <Â < Òl ox are regarded as 
inequality constrains. Because J (0, ag) is unavailable, we use J instead of J (0, ax), 
and rewrite x = bo. Then the redundancy resolution problem is reformulated as the 
following quadratic optimization formulations: 


min u'u, (2.10a) 
s.t. bo = Ju, (2.10b) 
HER, (2.10c) 


where 22 = {u € R” už in <ui < Ual is a convex set describing the physical 
constraints, where u’, = max{a (0; 


min min 0i), rinks Meas = min{a Ohrax — 9), haxh 
a > 0 is a positive constant. The convex set ensures the boundedness of both joint 
angles and speed [44]. According to the Karush—Kuhn—Tucker condition [45], an 
equivalent description of the optimal solution to the quadratic optimization as shown 


in (2.10) is described as 


u = Po(u—aL/au), (2.1 1a) 
bo = Ju, (2.11b) 


where Po(e) is a projection operation to the set 2, Po(x) = argmin,.g||y — x||, 
and L = L(u, À) isa Lagrange function defined as L (u, 4) = uTu/2 + AT (bo — Ju), 
where à € R” is a Lagrange multiplier corresponding to the equality constraint. 

Note that the difference between J and J would lead to extra error, which will 
result in tracking failure. To resolve the quadratic optimization problem (2.11), we 
are ready to present the RNN based controller together with the updating kinematic 
parameters online: 
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Algorithm 1 The proposed tracking method 


Input: Parameters k, a, J”, €, joint angle limits Olo 6! as joint speed limits OE ans (e initial states 
u(0), 0 (0), nominal kinematic parameter â (0), desired path xa (t), xa (t), task duration T, feedback 
of end effector x(t), analytical expressions of estimated Jacobian matrix J and kinematic regressor 
matrix Yz. 
Output: To achieve task-space tracking of the redundant manipulator 

1: Initialize 4(0), ax (0) <— ay. 

2: Repeat 

H x, 0 < Sensor readings 

Calculate tracking error e < Equation (2.4) 
Position loop controller bọ <— Equation (2.6) 
Update control output u by ù < Equation (2.12a) 
Update state variable A by À < Equation (2.12b) 

8: Update model parameter a, by âk <— Equation (2.12c) 
Until (t > T) 


SEON- EUD: 


eù = —u + Pe(— JTA), (2.12a) 
ch = Ju — bo, (2.12b) 
åk = -N YT (0, we, (2.120) 


where ¢ is a positive factor scaling the convergence of RNN. The proposed control 
scheme is shown in Algorithm 2.3.2. 


Remark 2.2 It is worth pointing that although the proposed RNN in (2.12a) and 
(2.12b) looks similar to existing ones (e. g., [46, 47]), the modification is very mean- 
ingful. The proposed RNN is capable of handling kinematic uncertainties. When the 
kinematic parameters a, in known, J is equal to J, (2.12a) and (2.12b) have the 
same expression with traditional ones, which shows that a known parameter case is 
only a special form of our control scheme, thus the proposed RNN is more general. 
The proposed control scheme offers an important expansion to model uncertainties, 
which is of universal significance in engineering applications. 


Remark 2.3 Using the proposed RNN based controller, the control command u(t) 
can be derived according to (2.12a), which is capable of optimizing u™u, mean- 
while, the projection operation P(e) handles inequality constraints. (2.12b) plays 
an important role in task-space tracking. By referring to (2.12c), we update the Jaco- 
bian indirectly by renewing its kinematic parameters online based on the property 
(2.3), which is different with other Jacobian adaption mathods (e.g., [48]), where 
joint acceleration is required. The necessary values of our updating law are joint 
angle 0, joint speed u and tracking error e, therefore, the proposed control strategy 
can be realized easily. 
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2.3.3 Convergence Analysis 


In this part, we conduct theoretical analysis on the convergence of tracking error 
under the RNN based tracking controller (2.12a) and (2.12b) along with the kinematic 
parameter updating law described in (2.12c). 

Firstly, two Lemmas are offered as below, which will be used in the proof process 
of convergence analysis. 


Lemma 2.1 For any closed convex set A € RP, (x — Pa(x))"(Pa(x) — y) > 0, 
Vy € A, Vx € R?, and the equality holds only if x € A [49]. 


Lemma 2.2 For any closed convex set A € RP, (x — Pa(x))"(x — y) > 0, Vy € A, 
Vx € R?, and the equality holds only if x € A [47]. 


Based on Lemma | and 2, we can obtain the following theorem about convergence 
of tracking error under the proposed redundancy resolution scheme (2.12). 


Theorem 2.1 The control error e(t) defined in (2.4) for a redundant manipulator 
globally converges to 0, provided the RNN based redundancy resolution (2.12a) and 
(2.12b), along with the kinematic adaptation law (2.12c). 


Proof: The convergence analysis includes two parts. Firstly, we will prove that 
the output u of proposed RNN (2.12a), (2.12b) would reach the optimal solution 
of (2.11). Secondly, we will show the convergence of tracking error e along with 
the adaptation law (2.12c). Note that the proof bears similarity to that with known 
parameters, but the extra dynamics on parameter adaptation makes it necessary to 
analyze the joint stability, which constructs the main difference of this proof from 
existing works. 

Part I. By defining £ = [u", 47], controller (2.12a), (2.12b) can be reformulated 
as 


e£ = —§ + Pa Œ — R(E)), (2.13) 


where 2 = {(u, A)|u € 2,4 € R”}, and R(E) = [u — JTA, —bo + Sul". Define 
VR = OR(&)/0é, we have 
_ fT 
VR= [; J iP 


J 0 


where 7 is a n-dimensional identity matrix, and VR € RO"*™™*"+” is a skew sym- 
metric matrix. The transpose matrix of V R is defined as VTR. Remarkable that VR 
satisfies the following positive semi-definite property: 


y'VRy = y'(VR+V'R)y/2 > 0, Vy e R™™. (2.14) 


This property will be used later. Define the following Lyapunov function candidate 
as 
Vi = |l — Poll /2. (2.15) 
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It is obvious that V, = 0 if and only if € 92. According to the result of reference 
in E.9, d||E — Pe (&)| 5/0 = 2(€ — Pg» (é)). Differentiating V; with respect to time 
and substituting (2.13) yield: 


Vi =(& — Pa) "E 
= — (& — Pa EDT E — PaE — R(E)))/e. (2.16) 
There is no doubt that Pa (E — R(E)) € Q, according to Lemma 2, VE € here sat- 
isfies the inequality (§ — Pa(&))"(& — Pa (E — R(é))) = 0. Then we have Vı < 0 
because € > 0, and V; = 0 only if £ € 92. Based on to LaSalle’s invariance principle 
[50], it can be proved that £ gradually converges into 22, which indicates u converges 


into §2 , the boundedness of joint angles and speed is thus guaranteed. Note that the 
equilibrium point &* satisfies 


&* = Po (&* — R(E*)). (2.17) 
According to definition 1 and Lemma 1 in [51], &* satisfies the following property 
(y —§")"RE*) 20, Vy Ee @. (2.18) 
Define function V> as 


V =(E — Po (E — R(E))) RE) + lI — E7113 /2 
— |I — Po € — RE))IB/2+ Vi. (2.19) 


Some mathematical calculations on the first and third items of the definition (2.19) 
give 
E — Pa — RE)))" RE) — ll — Pa — REDIŻ/2 
>(& — Pa — REDRE) — |E — Pa — REDI 
=(¢ — R(E) — Po (€ — RED" (Pa — RE)) — £). (2.20) 


Noticing that £ would gradually converge into the convex set 2, then we get € Q. 
According to Lemma 1, inequality (£ — Po (é — R(é)))'- (Pal — R(€)) —&) = 0 
holds for any £ — R(&) € R”*". Recalling the definition of V2, we have 


Vo > JIE — 6113/2 + Vi. (2.21) 


Thus Vz is a Lyapunov function candidate. Differentiating V2 with respect to time 
and combining (2.13) yields: 


V2 =(E — Po (E — R(E))) "VRE + ETRE) + (E — ETE 


2.3 Main Results 27 


— &— PoE — R) "E + Vi 
= — (€ — Pa(& — RE)TVRE — PaE — R(E)))/e 
(E — RE) — Po (E — RE)" (PoE — RE)) — &")/e 
— (E —&*)'RE)/e+ Vi. (2.22) 


Remarkable that £* € 2 , according to Lemma 1, inequality (£ — R(E) P(E 
R(E)))T (Pa (E — R(E)) — £*) > Oholds for any £ — R(€) € R™*". Using (2.14), we 
have —(€ — Pg (E — R(E))) 'VR(E — Ps (é — R(E)))/€ < 0 since € > 0. Accord- 
ing to mean value theorem, we have 


R) — RE*) = VREDNE — £*), (2.23) 


where &’ € [&, &*]. After some mathematical calculations on the following polyno- 
mials and substituting (2.23), we have 


É- ERE) 
=(€ — &*)"VRE')E — &*) + E — ESTRES). (2.24) 
Using the properties (2.14) and (2.18), we have (€ — &*) "'VR(é')(é — £*) > 0 and 


(E — &*)R(E*) > 0, then 
(E —&*)TR(E) > 0. (2.25) 


Combining inequalities (2.16), (2.24) and (2.25) yields V < 0, and Vz = 0 only 
if £ € Q, which indicates (£ — Po (E — R(E)))TV R(E — Po (E — R(E))) = 0, (E — 
R(E) — Pa lE — RE)" (PaE — RE)) — &*) = 0 and ( — £*)" R(E) = 0. From 
(2.24), we get (E — &*) 'VR(E')(E — &*) and (£ — &*)"R(E*) = 0. Notable that £ = 
&* is the solution of the above equations. Based on LaSalle’s invariance principle, we 
arrive at a conclusion that £ would gradually reach its equilibrium point &*, i.e., u(t) 
would converge to its optimal solution of redundancy resolution problem (2.10). 
Part II. Consider the Lyapunov function candidate 


V3 = e'e/2+4, rT 'ă/2. (2.26) 
Differentiating V3 with respect to time and substituting (2.4), (2.8) and (2.9), we have 


V3 =e'Ga= 3) + array 
=e" (ža — J (0, ap)u + (1 — 1)kle|?sgn(e) + aTr lay 
=e" (bo — Y (0, U) (Âk — Gx) — kle|’sgn(e)) — a Y; (0, we 
=e" (bọ — Ju + Y4(0, way) — klel?*! — aT YT (0, we. (2.27) 
As proved above, using the neural network (2.12), uTu will be minimized under the 


constraints bọ = Ju and u € &2. Notable that a Y A (0, u)e is a scalar value, we have 
ar yT O, u)e = e'Y; (0, u)ay. Then (2.27) can be rewritten as 
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V3 =e Y, (0, u)ay — klel? +! — al YT (0, we 
=—klel|?t! <0. (2.28) 


Then we have e = xq — x is bounded. Taking the time derivative of V3, we have: 


V3 = —k(p + Lel’sgn(eyé 
= —k(p + Dlel’sgn(e)(ta — J (0, apu). (2.29) 


Since J (0, ap) is composed of trigonometric functions of 0 and kinematic parameters 
ak, J (0, ax) is bounded. xg is also bounded. As illustrated in part I, u is bounded, 
thus Vi is guaranteed to be bounded. Using Barbalat’s lemma [52], we have V3 > 0 
as t —> oo. Then e — 0 as t —> ow. This completes the Proof. 


Remark 2.4 The convergence analysis shows the stability of the proposed control 
strategy. The tracking error would globally convergence to 0. The proof also illustrates 
that the control command u(t) is ensured u(t) € 22, Vt > 0, provided u(0) € 2, the 
boundedness of joint speed is thus guaranteed all the time. 


2.4 Illustrative Examples 


2.4.1 Numerical Setup 


We consider the position tracking problem in task space, then JACO, can thus be 
regarded as a functional redundant manipulator. The architecture of JACO, is shown 
in Fig. 2.2, and the DH parameters are shown in Table 2.1. Noticing that the last 3 
joints of JACO, do not intersect at a single point, these joints cannot be simplified 
as spherical joint, therefore the configuration of JACO is more general than other 
6DOF manipulators, e.g., the PUMA 560. The initial state of joint position vector 


Fig. 2.2 The physical = a NR Ps 
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Table 2.1 DH parameters of the Kinova JACO? robot manipulator 


Link ai (m) di (m) aj (rad) 6; (rad) 
1 
2 + 1/2 
3 63 — 1/2 
4 0 
5 0 H 
6 0 —0.2028 T 06 — 1/2 
0.2 
6, 
~ 5 Tey 
E N =; 
z0 s 
2 
o 
F 
~ 
9-0.2 
= 
-0.4 
0 1 t(s) 2 3 


2 
— 9, 
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_ 1.5 A A 
3 "3 ne] 
iy -0 g 
z 1f i = 
T bs Oo 
g 05 > 
5 £ 
E 3 
S 
-0.5 - 
0 1 2 3 0 1 (s) 2 3 


(c) (d) 


Fig. 2.3 Results of regulation control on JACO; to a fixed point [0.3, 0.4, 0.4]m in the Cartesian 
space. a Motion trajectory of end effector (red curve) and the corresponding incremental con- 
figurations of JACO2. b Error-time curve along three directions. € Angle-time curve of 6 joints. 
d Command-time curve of joint velocity u 
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Fig. 2.4 Results when 
JACO), tracks a given circle 
in Cartesian space. a Motion 
trajectory of end effector (red 
curve) and the corresponding 
incremental configurations 


0.8 


Tracking error(m) 


of JACOp. b Error-time 8 

curve along three directions. xm) 0592 o gy 04 ° s m 
c Angle-time curve of 6 (a) 

joints. d Command-time 

curve of joint velocity u. 25 
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6(0) is randomly set as [0.5, 0, 1.5, 0, 0, O]"rad, and the initial joint speed u(0) 
is selected to be zero. The nominal values of kinematic parameters are selected as 
â (0) = [0.25, 0.2, 0, —0.2, —0.1, —0.2]'m. The set 2 describing joint speed limits 
are set to be [—2, 2]°rad/s. The control gain k is set to be 8, and the gain matrix I 
is selected as 0.57, where J is a 6-dimensional identity matrix (Fig. 2.4). 
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2.4.2 Fixed-point Regulation 


In order to verify the proposed tracking strategy, a set value adjustment experiment 
is carried out. Set the desired fixed position of the end-effector to [0.3, 0.4, 0.4]'m, 
select the zoom factor to be € = 0.008. The simulation results are shown in Fig. 2.3. 
The adjustment error converges to zero, and the convergence time is about 0.5s, as 
shown in Fig. 2.3b. Therefore, the joint angle @ reaches a set of constant values, as 
shown in Fig. 2.3c. The combined velocity u(t)reaches the limit at the beginning of 
the simulation, making the end-effector move toward the target as fast as possible, 
and slowing down rapidly when the end-effector approaches the target. During the 
whole simulation process, u is guaranteed not to exceed its limit, as shown in Fig. 
2.3d. Finally, as shown in Fig. 2.3a, the robot successfully reaches the fixed point 
under the proposed control scheme. 


2.4.3. Circular Trajectory 


In this section, tracking of a smooth circular trajectory using the control scheme is 
carried out. The end effector of JACO, is expected to move at an angular speed of 
0.5rad/s along a circular trajectory. The desired circle is centered at [0.3, 0.3, 0.3]" 
with a radius of 0.1732m, and has a revolute angle of 45° around the x-axis. The 
scaling coefficient is selected as € = 0.008 The convergence time is about 0.5s, which 
is similar to the regulation case. As shown in Fig. 2.4d, when the simulation begins, 
the robot moves at the maximum speed when the tracking error is big (Fig. 2.3a), 
which makes the end effector move close to the desired circle. Then the robot moves 
at a low speed periodically, and correspondingly, the joint angle 0 changes with the 
same frequency (Fig. 2.4b), meanwhile, the tracking error is already close to zero 
(Fig. 2.4a), which means that the robot has successfully tracked the desired circular 
trajectory with time. According to (2.6), the reference speed vector of the end-effector 
bo can be derived, and its components along x—, y—, and z— directions are shown 
as blue lines in Fig. 2.4e-g, in which red lines represent the corresponding values 
of Ju. The red lines quickly converge to blue ones, demonstrating that the proposed 
control strategy is able to track the given trajectory under kinematic uncertainties. 
The joint velocity norm ||u| k is shown in Fig. 2.4h. 


2.4.4 Square Trajectory 


In this section, the JACO, is used to track a square trajectory. The corners of the 
desired square in the Cartesian space are set to be [0.3, 0.4, 0.4]7, [0.4, 0.3, 0.4]7, 
[0.3, 0.2, 0.2] and [0.2, 0.3, 0.3]T. The motion period is 12.56s. The velocity norm 
of desired path ||xa(t)|| keeps constant, which means that the expected velocity 
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Fig. 2.5 Results when 
JACO), tracks a square 
trajectory in Cartesian space. 
a Motion trajectory of end 
effector (red curve) and the 
corresponding incremental 
configurations of JACO>. b 
Error-time curve along three 
directions. c Angle-time 
curve of 6 joints. d 
Command-time curve of 
joint velocity u. e The first 
Cartesian velocity input 
bo(x-axis direction) 
described by (2.6) and the 
corresponding output Ju(l). 
f The second Cartesian 
velocity input bo (y-axis 
direction) described by (2.6) 
and the corresponding output 
Ju(2). g The third Cartesian 
velocity input bo (z-axis 
direction) described by (2.6) 
and the corresponding output J r a T 02 i 
Ju(3). h The Euclidean © l O 
norm of the manipulator’s 
joint velocity 
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of end effector xa(t) remains constant between two adjacent vertices, while xa(t) 
changes discontinuously at the four corners. The scaling coefficient is selected as 
e = 0.008. Numerical results are shown in Fig. 2.5. In the initial stage, the tracking 
error approaches zero over time after a short transient state, at the same time, the joint 
speed remains within the set 2 at all times (Fig. 2.5d). The output of the position 
controller (2.6) and the resulting responses under the proposed control scheme along 
the x—, y—, and z— directions are shown in Fig. 2.5e—g. The red lines converge to blue 
ones quickly both at the beginning of simulation and after discontinuous change of the 
desired velocity, and the joint speed also switches at these moments, as shown in Fig. 
2.5d. As a result, there exist vibration on the error curve at time t = 3.14, 6.28, 9.24, 
12.56, 15.7, 18.8 s, with the maximum value of [4 x 1073, 2 x 1073, 1.5 x 1073]"m. 
The joint velocity norm ||u| a is shown in Fig. 2.5h (Table 2.2). 
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Table 2.2 Comparisons among different tracking controllers on manipulators 


Method | Exact JMPI Initial Joint Regulation | Tracking | Real-time 
model 
Exact Position | Velocity | Error error Control 
model 

This Not No Any Limited Zero Zero Yes 

chapter | required 

Controller | Not Yes Any Unlimited | Zero Failed Yes 

in [28] required 

Controller | Not Yes Any Unlimited | Zero Zero Yes 

in [35] required 

Controller | Required | No Any Limited Zero Zero Yes 

in [39] 

Controller | Required | No Any Limited Zero Zero Yes 

in [40] 

Controller | Required | No Any Unlimited | Zero Failed No 

in [61] 

Controller | Required | No Restrictive] Limited Failed Zero Yes 

in [62] 


2.4.5 Comparison 


In this section, we compare the proposed method with the performance of the existing 
redundant robot tracking controller as shown in Table 2.2. JMPI [28, 35] and RNN 
policy-based tracking controller [39, 40, 61, 62]. In the reference [28, 35] and our 
study, the exact kinematic model of the robot is not needed, which can be used 
to solve the kinematic uncertainty problem.The controllers proposed [28, 35, 40, 
61] are calculated according to the speed level, while the controllers proposed [39, 
62] are designed according to the acceleration level.In this chapter, we develop a 
speed-level controller. The controller obtains the control command in [28, 35] by 
computing the pseudo-inverse of the jacobian. These strategies cannot be used when 
the robot is in a singular configuration. Although DLS [22] and other improved 
methods are introduced, the convergence of tracking error of singular configuration 
cannot be guaranteed, and its physical limitations are not considered. In addition 
to referencing [62], the initial position of the end-effector can be set randomly in 
the controller, which needs to be on the desired path when referencing [62]. The 
controllers kin [39, 40, 62] based on RNN can guarantee the boundedness of the 
control command. The three controllers can track the time-varying trajectory, but 
the position adjustment fails in [62]. In summary, our controller can achieve stable 
tracking under both regulation and path tracking, and it does not need the accurate 
kinematic model and pseudo-inverse calculation of the Jacobian matrix, so it has 
good flexibility and adaptability to the uncertain environment. 
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2.5 Summary 


This chapter studies the kinematic control of redundant robots with uncertain kine- 
matics. A dynamic neural network is proposed to solve the redundancy problem by 
using an adaptive motion identifier to learn motion parameters online. The interaction 
between the adaptive online identifier and the neural controller makes it a nonlinear 
coupled system. The global convergence of tracking error is verified by the Lyapunov 
theory. Numerical experiments and comparisons based on JACO, robot arm demon- 
strate the effectiveness of the algorithm and its superiority over existing algorithms. 
This method is combined with RNN to realize static and dynamic task space tracking. 
The pseudo-inverse computation of the Jacobian matrix is avoided and the real-time 
performance of the controller is guaranteed. The boundedness of joint speed can also 
protect the robot and improve safety performance. Before concluding this chapter, 
it is worth pointing out that this is the first dynamic neural model of motion con- 
trol for a manipulator with adaptive redundancy based on kinematic regression, with 
demonstrable convergence and guaranteed performance limits. 


Appendix 


According to (2.3), the analytical expression of JACO!s kinematic regressor matrix 
Y, € R**° is given as follows. 


Yi = 0, 
Y2= —Ayc1¢7 + 615152, 
Yi3 = —ĝicı, 


Yi = ĝa (c1c203 + €15253) + Ôi (€25153 — €38152) — Ô (C1C263 + C18283), 

Yis = 01 ((V3c1c4)/2 + (V3s4(s18283 + €2€351))/2 + (c28183)/2 — (c38182)/2) — 
6a (W381 54)/2+(W/3c4 (cye2¢3-+¢1 5283))/2) +62 ((e1c2¢3 + ¢15253)/2)—(W/3s4(c1¢253 — 
©1382))/2 + (c18283)/2) — 63((c1¢2€3)/2 — (W354 (C1283 — c1¢352))/2), 

Yis = (65((W3c5 (8184 + c4(c1C263 + €15283)))/2 + (W385 ((c481)/2 — (Salerer€3 
+ €15283))/2 + (V3 (c1c283 — c16382))/2))/2) — O4((38184)/4 — (W305 (5154) 
/2 + (ca(crercs + €18953))/2))/2 — (355(cas1 — s4(cic2c3 + €15283)))/2 + 
(V3c4(cie2c3 + €15253))/4) + O1((W3c1c4)/4 + (W395 (C154 — C4818253 + C263 
81)))/2 — (Wes ((crc4)/2 — (W3(c25193 — €35182))/2 + (sa(s18293 + €2¢381))/2)) 
/2 + (W3s4(s15253 + .€2€381))/4 + (c25183)/4 — (€38152)/4) + ô (+3 (c1c203 
+ €18253))/2))/2(/3es((s4 + (C1€253 — €1€352))/2 — (W3.84(1€283 — €1¢352))/4 + 
(cycacs)/4 + (c15283)/4 + (W/3cas5(c16283 — €10352))/2) — 3 ((V/ 305 ((s4(c1C283 
— €1€352))/2 + (V3 (c1c263 + c18283))/2))/2 — (W354(c10253 — €10352))/4 + 
(c1c263)/4 + (c15253)/4 + (W/3c4s5(c1¢253 — c1¢352))/2)), 


Yo, = 0, | 
Yoo = —01C182 — 020251, 
Yo3 = —451, 


Yq = 02(515283 + €2€381) — 03 (S185283 + €2€351) — 01 (C1C253 — C1038582), 
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Yos = 62((s15283)/2 + (cr¢351)/2 — (W354 (cr8153 — €35182))/2) — 63 (815253) 
/2 + (c20381)/2 — (W384 (c25183 — €35182))/2) + Êi ((V3cas1)/2 — (W384 (c1c263 + 
c18283))/2 — (c1c283)/2 + (€1€382)/2) + Êa ((v3c184)/2 — (/3.c4 (515253 + 
c2€351))/2), 

Yoo = 01 ((W3c451)/4 — (35 ((cas1)/2 — (sa(cicres + €18253))/2 + (V3 (cic2 
53 — €1€352))/2))/2 + (V355 (s184 + c4(c1c203 + €18253)))/2 — (W3s4(c102¢3 + 
€15283))/4 — (c1€283)/4 + (€1€382)/4) — 65((W/3e5 (C154 — c4(815253 + €2€351))) 
[2+ (/355((c104)/2 — (V3 (c28183 — €38152))/2 + (84818253 + €2€351))/2))/2) + 
B>((18253)/4 + (V3es((V3(s18283 + €2€381))/2 + (84(C28193 — €35182))/2))/2 + 
(c2¢351)/4 — (V354(c28153 — €35182))/4 + (W3ca85(cr8183 — €35152))/2) — 

83 ((s18253)/4 + (V3c5((V3 (818283 + €2€381))/2 + (84(C28193 — €35182))/2))/2 + 
(c2¢351)/4 — (W354(c28153 — €38152))/4 + (30455 (C25153 — €35152))/2) — 

84 ((/3e5 ((€154)/2 — (c4(818283 + €2€351))/2))/2 — (V3c184)/4 + (W355 (C164 + 
$4(815283 + €2€351))) 

/2 + (3ca(s15283 + €2€351))/4), 


Y3; = 0, | 
Y32 = —025p, 
Y33 = 0, 


Y34 = 03 (C283 — C382) — O2(c283 — C352), i 

Y35 = 63((c253)/2 — (€382)/2 + (W3s4(cacz + 8283))/2) — 82 ((c283)/2 — (€352) 
/2 + (V354 (c2c3 + s283))/2) + (V384c4 (C283 — €352))/2, 

Y36 = Ôs((V3s5((43 (C263 + s283))/2 + (s4(€283 = €352))/2))/2 — (W3ca0s 
(c283 — €382))/2) + Âa ((W3c4(c283 — €382))/4 + (35455 (C283 — €352))/2 — 
(V3c4cs (C283 — €382))/4) — O2((c283)/4 — (c382)/4 + (W/3e5((W/3 (C283 — €352)) 
/2 — (sa(cocs + 5283))/2))/2 + (W354 (Coe3 + 5293))/4 — (V3cas5 (C263 + $253) 
/2) + 03((c283)/4 — (c382)/4 + (W/3c5((W/3(c253 — €352))/2 — (s4(c263 + 5253)) 
/2))/2 + (W354 (coc + 5283))/4 — (3455 (C263 + 5283))/2). 
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Chapter 3 A) 
RNN Based Adaptive Compliance get 
Control for Robots with Model 

Uncertainties 


Abstract Position-force control is challenging for redundant manipulators, espe- 
cially for the ones considering both joint physical limitations and model uncer- 
tainties. In this chapter, we considered adaptive motion-force control of redundant 
manipulators with uncertainties of the interaction model and physical parameters. 
The whole control problem is formulated as a QP equation with a set of equality 
and inequality constraints, where based on admittance control strategy, the desired 
motion-force task is combined with the kinematic property of redundant manipula- 
tors, corresponding to an equality constraint in the formed QP equation. Moreover, 
the uncertainties of both system model and physical parameters are also considered, 
together with the complicated joint physical structure constraints, formulating as a 
set of inequality constraints. Then an adaptive recurrent neural network is designed 
to solve the QP problem online. This control scheme generalizes recurrent neural 
network based kinematic control of manipulators to that of position-force control, 
which opens a new avenue to shift position-force control of manipulators from pure 
control perspective to cross design with both convergence and optimality consider- 
ation. Numerical results on a 7-DOF manipulator LBR iiwa and comparisons with 
existing methods show the validity of the proposed control method. 


3.1 Introduction 


A manipulator is called redundant if its DOFs are greater than those required to 
complete a task. The redundant DOFs enable the robot to maintain the position and 
direction of the end actuator to complete a given task and adjust its joint configuration 
to complete a secondary task. Take advantage of this feature, typical manipulator 
systems such as collaborative robots, space robotic arms, dexterous hands [1, 2] are 
all designed as redundant ones. 

In Chaps. 1 and 2, we mainly focus on kinematic problems, in which we assume 
the end-effector of a manipulator could move freely in cartesian space. In fact, in 
industrial applications, the interaction between robot and external environment must 
be considered, for example, in tasks such as grinding, human-robot interaction, not 
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only the high-precision motion control to a given trajectory should be guaranteed, 
but also the contact force exerted by the external environment should be guaranteed. 

There are several approaches to achieve force control for robot manipulators. By 
introducing series elastic actuators as flexible units, force control can be realized by 
adjusting the compliance of joint angles. In [3], in order to overcome the discon- 
tinuous friction and complexity problem of traditional back-stepping based meth- 
ods, a modified command-filter is introduced, and then an adaptive back-stepping 
controller is designed. Experimental results show the effectiveness of the proposed 
method. Other control schemes realize force in cartesian space. The most widely 
used method is called impedance control [4], where the robot and environment are 
regarded as impedance and admittance, respectively. The interaction model (which is 
also called impedance model) can be a spring-mass-damper system, spring-damper 
system, etc. Besides, a series of hybrid position-force controllers are designed in 
[5, 6], which consist of two independent loops, namely position loop, and force 
loop. By designing control schemes separately, the final control efforts can be for- 
mulated as the sum of the output of the independent loops. Similar research can be 
found in [7-9]. 

In industrial applications, the accurate value of the impedance model can hardly 
be obtained. for example, the stiffness parameter may be sensitive to environmen- 
tal factors such as temperature, humidity, etc. Besides, the uncertainties in physical 
parameters would also affect control performance. In order to due with the uncertain- 
ties in the interaction model, in [10], an adaptive impedance controller is designed, in 
which a neural network is used to learn the nonlinear dynamics of the interaction part. 
In [11], by considering the influence of unknown dynamics of the external environ- 
ment, and a radial basis function based controller is proposed, in which an objective 
function is used to regulate the torque and an adaptive admittance technique is used 
to minimize path tracking errors. In [12], a human-like-learning based controller 
is designed for interaction with environmental uncertainties. It is proved that the 
controller is capable of handling unstable situations such as tool switching, and can 
finally achieve an expected stability margin. Besides, contact force sensors are not 
required. Using the approximation ability of artificial neural networks, some intelli- 
gent controllers are reported in [13—17]. As to physical uncertainties, in [18], a fixed 
point controller is proposed based on robust adaptive control theory, the controller 
also ensures the bounded-ness control torque. Cheng et al. propose a unit quaternion 
based controller based on neural networks [19], which shows good performance in 
eliminating singularities, and semi-global stability is proved by theoretical results. In 
[20], a Jacobian adaptation method based on zeroing dynamics is proposed, in which 
the Jacobian matrix is updated according to the information of desired and actual 
accelerations. Other feasible adaptive strategies are reported in [21—24], in which the 
Jacobian is estimated by updating physical parameters online. As to physical con- 
straints, in [42], an adaptive neural network control scheme is designed for systems 
with non-symmetric input dead-zone, as well as output constraints and model uncer- 
tainties. The output constraints are guaranteed by the barrier Lyapunov function. In 
[43], a boundary adaptive robust controller is established for flexible rise systems, in 
which an auxiliary system is introduced to suppress vibrational offset, and an estima- 


3.1 Introduction 41 


tor is constructed to observe to upper-bound of disturbances. The controller achieves 
the global convergence of control errors. Although the above-mentioned controllers 
could handle uncertainties in the interaction model or physical parameters, few stud- 
ies have considered both uncertainties at the same time. More importantly, those 
controllers rarely consider the secondary task, let alone the redundancy resolution 
problem. Besides, the boundary of joint states is ignored, which is essential in pro- 
tecting the robot. 

In order to accomplish the secondary task in the reliable physical range, a kine- 
matic control method of redundant manipulator based on QP is proposed [25-28]. 
The objective function is based on the secondary task, and the constraints describe the 
basic properties and physical constraints of the system [29]. Because of the high effi- 
ciency of parallel computing, the recurrent neural network is often used to solve the 
redundancy decomposition problem based on QP. In recent years, research shows 
that RNN based controller has good performance in motion control of redundant 
manipulator [30]. In [31], in order to achieve task space tracking, the joint velocity 
command is designed to ensure the boundary of joint angle, velocity, and acceler- 
ation. In the paper [32], by maximizing the indirectness of its time derivative, an 
operational optimization scheme is proposed. Numerical experiments show that the 
average increase in this method is 40%. In [33], different levels of redundancy res- 
olution are discussed. Recently, RNN based methods have been extended to control 
examples of flexible robots, multi-robot systems, and methods such as [34—40]. How- 
ever, as far as we know, there is no existing dynamic neural network (including RNN 
and DNN) protocol for the force control of redundant manipulators. It is necessary 
to consider not only the trajectory tracking problem of free-motion direction, but 
also the precise control of the contact force, especially for the system with model 
uncertainty. In addition, from the literature review, one of the research directions 
of a dynamic neural network is to extend the protocol of redundant manipulator of 
motion control task to those aspects that need precise control of tracking ability and 
contact force. 

Based on the above observation results, we propose the first RNN based redundant 
manipulator position force controller, which considers the uncertainty of the interac- 
tion model and physical parameters. In this paper, the ideal case of the known model 
parameters is discussed, and then an adaptive admittance control scheme based on 
RNN is established. It ensures the boundary of joint angle and velocity. The effective- 
ness of the proposed controller is verified by the theoretical derivation and numerical 
results of LBR iiwa. Before concluding this chapter, the main contributions compared 
to the existing work are as follows 


e As far as we know, this is the first time to focus on the motion-force control 
of redundant manipulators with model uncertainties based on the framework of 
RNNs, comparing to researches on kinematic control, the research is an important 
extension in the theoretical framework of dynamic neural networks. 

e Different from traditional methods, an optimization-based controller is proposed, 
while ensuring the stability of the system, physical limitations are also guaranteed. 
Which is very helpful to improve the security of the system. 
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e The controller proposed in this chapter is able to learn uncertain model param- 
eters online, which has better robustness and adaptability to uncertain working 
conditions. 

e The proposed method is pseudo-inversion-free, which can save computing costs 
effectively. 


3.2 Preliminaries 


3.2.1 Problem Formulation 


When a robot is controlled to perform a given operational task, the forward- 
kinematics of a serial manipulator is formulated as 


x(t) = f@), (3.1) 


with 0 € R” being the generalize variable of the robot, and x(t) € R” being the 
description of end-effector’s coordinate in task space. Without loss of general- 
ity, in this chapter, we assume that all joint are rotational joints. Therefore, 0 
represents the vector of joint angles. In the velocity level, the Jacobian matrix 
J(6, ak) = 3 f (0 (t), a)/d0(t) € R”*” is used to describe the relationship between 
X(t) and 6 as 

X(t) = J (OE), ax) O(0), (3.2) 


where a, € R’ is a vector of physical parameters. In terms with (3.2), an important 
property which will be used in the controller design is given as below 


JOH, a,)O(t) = Y (0, Jar, (3.3) 


with Y (0, 6) € R”*! being the kinematic regressor matrix. 

The physical parameters are very essential in describing the robot’s kinematic 
model, for example, as the most common physical parameters, the length of robot 
links affects the DH parameters directly, which are fundamental in the controller 
design. In this chapter, the physical parameters refer to the length of robot links. 

Figure 3.1 shows the interaction between the robot and environment, the contact 
force between the robot and workpiece is required to be precisely controlled. When 
the fixed contact surface is known, according to the idea of admittance control, the 
interaction model can be described as a spring-damping system as 


F = Ky(x — xa) + Ka(x — Xa), (3.4) 


where K, € R?” and K4 € R? are the corresponding damping and stiffness coef- 
ficients, xg is the desired trajectory. If K, and K4 are known, the desired contact 
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Fig. 3.1 Spring-damper K 
model of interaction 
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force Fy can be obtained by designing the reference velocity of the end-effector x, 
based on xa, Xa, x and Fy according to Eq. (3.4). 


å, = K7'F — K} 'K,p(xa — x) + ža- (3.5) 


Remark 3.1 In this chapter, we only consider the contact force on the vertical direc- 
tion of the contact surface, and friction force is ignored, therefore F is aligned with 
the normal direction of the surface. When the surface is priorly known, by defining 
a rotational matrix R between the tool coordinate system and based coordinate sys- 
tem, K „ and K4 can be formulated as K, = diag(0, 0, kp)R, Kp = diag(O, 0, ka) R, 
respectively. Then K, and K, can be described as single parameters. 


In practical applications, the real values of system parameters such as ag, K p and 
Ka are usually unavailable. In terms of ag, due to machining and installation error, 
the length of robot's links may be different from the nominal value, and the robot 
may hold uncertain tools, which would lead to uncertainties in ag. As to K, and 
Kı, the real values are even more difficult to obtain. Kz and K, are related to the 
material and structure of the workpiece, furthermore, those parameters would change 
in different environmental conditions. Therefore, it is a challenging issue to achieve 
precise force control in the presence of parameter uncertainties. 

For a redundant manipulator, the redundant DOFs can enhance the flexibility of 
the robot, and this property can be used to achieve a secondary task. In industrial 
applications, by minimizing the norm of joint speed, the kinematic energy can be 
optimized. Therefore, in this chapter, the objective function is selected as 


H(6) = Tô. (3.6) 


In order to save energy consumption in the control process, a smaller value of H (0) 
is preferred. 


Remark 3.2 The objective function H (Ê) is a typical function to describe the sec- 
ondary task in redundant resolution problems, as reported in [21, 25]. In actual imple- 
mentations, this function can be defined according to the designer’s preferences or 
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actual requirements. In this chapter, we propose a generalized RNN based force con- 
trol strategy with simultaneous optimization ability. Based on the proposed control 
strategy, similar controllers can be easily designed by defining different objective 
functions. 


3.2.2 Control Objective and QP Problem Formulation 


Before pointing out the control objective, it is noteworthy that the robot must satisfy 
certain constraints. For example, due to the physical structure, every joint angle 6; 
must not exceed its limitations i.e., the lower bound 6, and upper bound 6; . Fur- 
thermore, limited by actual performance of actuators, joint speed 6 is also restricted, 
ie, 6- <6 < 6+. 

When the actual parameters of interaction model are unknown, the control objec- 
tive of this chapter is to design a force controller with adaptation ability, i.e., to 
realize accurate force control along the predefined contact surface, in the sense that 
F — Fy, at the same time, physical constraints of joint angles and velocities must be 
ensured. According to (3.2), (3.5) and (3.6), the control objective can be described 
in the view of optimization as 


min H(@) = 676, (3.7a) 
st. x, = J (0,apÊ, (3.7b) 
å, = Kz' Fa — Kz'Kp(x — xa) + ža, (3.7c) 
0 <6 <6t, (3.7d) 
6- <6<6". (3.7e) 


Remark 3.3 So far, we have arrived at a generalized description of admittance con- 
trol for redundant manipulators in the QP problem. Apparently, there exist parameter 
uncertainties in J(@, ax), kp and kg as formulated in (3.7b) and (3.7c). In the next 
chapter, we will solve the problem (3.7) with the aid of RNNs. 


3.3 Main Results 


In this chapter, an recurrent neural network based adaptive admittance controller 
is proposed to solve (3.7). An ideal situation where real values of system model 
are perfectly known is firstly considered, which lays the foundation of the later 
discussion. Then an adaptive RNN is proposed to achieve force control in the presence 
of model uncertainties. We also show the stability of the control method. 
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3.3.1 Nominal Design 


In order to explain the proposed adaptive control scheme more clearly, an ideal case 
in which all parameters are perfectly known is firstly discussed. It can be regarded as 
a special case of uncertain parameter ones. In this case, both K4 and K, are available, 
then the real value of x, is available according to (3.5). 

Let w = 6 and define a Lagrange function as Lı = oTo +AT (Jæ — FıK7' + 
Kp K7' (x — xa) — ža), with A being the Lagrange multiplier. Similar to [25], a RNN 
with provable convergence can be designed as 


eò = —w + Po(—JTA), (3.8a) 
eh = —K7' Fi + K7'Kp(x — xa) — ža, (3.8b) 


where € is a positive constant and P,,(e) is a projection operator to set 2 as 
P..(x) = argminy.g||y — x||, and the set 2 = {w € R” ai; < @; < Obax} is a con- 
vex set describing the modified speed constraints based on escape velocity method 
[29], and Onin = max{a@ (Omin = 0), Omin} Umax = min{a (Omax = 0), Omax}, a>0. 
The stability of system can be readily proved, which is similar in [25], is omitted 
here. 


3.3.2 Adaptive Control Method Based on RNN 


Based on the previous description, in this subchapter, by learning the uncertain param- 
eters online, an adaptive RNN is established to solve the force control problem with 
gravity torque optimization under model uncertainties, the stability of the system is 
also proved. 


3.3.2.1 Adaptive RNN Design 


In order to handle the uncertain interaction parameters K, and Kg, let K p and Ra 
be their estimates. Although K, and K4 are unknown, they are considered to be 
constant. Then the estimated reference velocity %, can be derived by replacing Kp, 
Kı with K p and Ka respectively according to (3.5) 


å, = R7! Fa — Ky Kye — xa) + ža. (3.9) 


Let n = [x — xa, år — ža]!, W = [Kp, Ka]" and Ŵ = [Kp Éa]. Then we can 
rewrite (3.9) as Fy = W'n. However, due to the uncertainties in Kg and K p» in the 


actual process, the resulting contact force F using x, directly is F = W'n, it is 
noteworthy that the contact force F can be measured by force/torque sensors. 
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As to the uncertain az, the alternative Jacobian matrix J (6, âx) is used by substi- 
tuting a, with its estimate âz, then J (6, ax) in equality constraint (3.7b) is replaced 
by J (0, â). Therefore, the force control problem with joint speed optimization con- 
sidering model uncertainties can be formulated as 


min H(w) = o'o, (3.10a) 
st. J(0,dy)@ = R7' Fa — K7'K p(x — xa) + ża, (3.10b) 
0T <0 <0*, (3.10c) 
Oo < w< w”. (3.10d) 


To solve (3.10), by defining a Lagrange function as L = œw + A(J (0, &)@ — 
X,), the adaptive RNN is designed as 


eò = —w + Pe(—J"A), (3.1 1a) 
eh = J (0, aw — år, (3.11b) 
Ŵ = -T,n(Fa — F)’, (3.11c) 
ay = -DYT(J (0, âo — `), (3.11d) 


where £, I and I» are positive gains. Figure 3.2 shows the framework of the proposed 
adaptive RNN in real-time force control with uncertain parameters. In order to learn 
the uncertain parameters, the neurons W and â; update their values based on desired 
signals xq, Xq and Fy and the feedback of x, x and F. The output of the RNN is 
exactly the joint speed command w. By designing proper updating laws, A and w 
achieve both stability of the inner loop and the optimization of H (œ). 


Remark 3.4 In this chapter, we consider the case where m = 6, n = 7 (where m is 
the dimension of the cartesian space, and n is the number of joint angles). Since only 
the contact force on the vertical direction of the surface is considered, the dimension 
of K4 and Kp are all 1(the contact surface if known). As illustrated in Fig.3.2, the 
proposed adaptive RNN has a typical one-layer architecture, and the total number of 
neurons isn +/+m+2. 


Remark 3.5 The proposed adaptive RNN (3.11) can be regarded as a generalized 
form of the nominal RNN (3.8), when W = W and â, = 0, it can be obtained that 


W =0and a = 0 from (3.3) and (3.9). Then (3.11) is the same as (3.8). However, 
it is remarkable that adaptive RNN is capable of dealing with model uncertainties. 
On the other hand, unlike the adaptive RNNs based kinematic control strategies in 
[21, 22], the proposed controller can achieve both precise control of both position 
and contact force. 
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Fig. 3.2 A schematic framework of the proposed RNN based force controller 


3.3.2.2 Stability Analysis 


So far, a theorem about the convergence of the force control problem using proposed 
adaptive RNN in presence of model uncertainties can be summarised as below 


Theorem 3.1 Consider the force control problem for a category of redundant manip- 
ulators described in (3.1)—(3.4) with model uncertainties, the state variable w of the 
proposed adaptive RNN will converge the optimal solution of (3.7), i.e., the force 
control error will converge to 0, and the norm of joint speed will be optimized simul- 
taneously. 


Proof The proof consists of three steps. Firstly, we will prove that W and âx could 
learn the model parameters online, and then the stability in the inner-loop is also 
analyzed. 

Step 1. Define the estimate error of concatenated form of W as W = W — W, and 
er = F — Fy be the error between the contact force and the desired signal. From 
(3.9), ep can be formulated as ep = W'n — W'n = W'. Consider the Lyapunov 
function as V; = tr(W'W) /2, which tr(e) is the trace of a matrix. Calculating the 
time derivative of V; yields 


Vi = tr(W'W) = tr(-l\W'n(Fa — F)") 
= tr(—Iyeses) = -Neper <0. (3.12) 


From (3.12) and (3.10) and using LaSalle’s invariance principle [41], we have ere {= 


0, as t — oo. In other words, the state variable W ensures the convergence of force 
error e p by modifying the end-effector’s reference speed Â, according to (3.5). 
Step 2. Define the estimate error of ag as 4, = âg — ag, and let Vz = apa, /2. It is 
notable that during the control process, a, can be regarded as constant, then we have 
aie = ay. Actually, using the property described in Eq. (3.3), based on linearization 
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Algorithm 3 The proposed adaptive RNN based force controller 
Input: Nominal values of interaction model K}, K} and physical parameters aj. Physical range 
of joint angles and joint velocities 0} ax» Olin 6! ax Ôi in The desired trajectory xa, žą and the 
desired contact force Fy. Positive control gains a, l1, I>, €. Sensor readings of contact force F and 
movement of the end-effector x, xy. Task duration T. 
Output: To achieve position-force control in presence of model uncetainties 

1: Initialize 4(0), a (0) <— ax, WO) < [K”; K"] 

2: Repeat 

4 x, Xx, 6, 6, F < Sensor readings 

Obtain the Jacobian matrix J (6, ay) and kinematic regressor matrix Y (0, 6) 
Calculate the modified reference trajectory i, by Equation (3.9) 
Update interaction parameters W by Equation (3.1 1c) 
Update physical parameters a; by Equation (3.1 1d) 
Update state variable à by Equation (3.11b) 

9: Update joint velocity command w by Equation (3.1 1a) 
Until (t > T) 


0) Sa. ON Pe > 


descriptions of Ê and ag, respectively, the task-space velocity x has two equivalent 
descriptions, namely J (0, an) and Y (0, ĝar. As a result, the estimated value 3 also 
has two similar descriptions, depending on the estimated value of kinematic parame- 
ter âx. Therefore, the updating law Eq. (3.11) is equivalent to — M YT (Y (0, œw)âk — x). 
Then it follows from (3.2) and (3.3) that 


A 


âk = -D Y"(J (0, a) — J (0, ayo) 


= — DY (0, w)"Y (0, w)ax. (3.13) 


In light of (3.13), V can be rewritten as 


V = àl âk 


= -D aălY (0, œ)"Y (0, w)ay < 0. (3.14) 


Then it can be readily obtained that Y (0, w)a, — 0 as t —> oo. From (3.3) and and 
definition of a, J (0, a,)@ will eventually converge to J(0, a;)a, i.e., the equality 
constraint (3.10b) will eventually be equivalent to (3.7b). 

Step 3. Then we will prove the stability of inner-loop system. According to (3.11), 
the dynamics of w and à can be reformulated as 


e = —§ + Polé — FE)I, (3.15) 


with € = [w", AT)", @ = {(o, Vlo € 2, à € R”}, and 


P- w+ JTA 
~ | -J@ — Fik; + K,)Kj' (x — xa) — xa} 


Define VF = 0F(&)/0&, we have 
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with J being the identity matrix. Then it can be readily obtained that VF + (V F)" is 
positive semi-definite. According to the definition in [32], F is a monotone function of 
€. From the description of (3.11) and (3.15), Pa canbe formulated as Pg = [P2; Pr], 
where Pr € R” isa projection operator of A to set R, with the upper and lower bounds 
being +oo. Therefore, Pg is a projection operator to closed set 2. Based on Lemma 
1 in [32], the adaptive RNN (3.11) is stable, and the will be ultimately equivalent to 
the solution of (3.7). This completes the Proof. 


Remark 3.6 Till now, we have shown the stability of the proposed RNN based 
adaptive admittance control strategy in the presence of uncertain model parameters. 
The established adaptive RNN is capable of maintaining the boundedness of system 
states and avoiding calculating the pseudo-inversion of the Jacobian matrix. 


3.4 Illustrative Examples 


In this chapter, numerical results on a 7-DOF robot manipulator LBR iiwa are carried 
out. The physical structure and D-H parameters of iiwa are shown in Fig.3.3. All 
we all know, up to 6 DOFs (3 DOFs of position and another 3 DOFs of orientation) 
are required to fulfill a given task in engineering applications, therefore, the iiwa 
is a typical redundant manipulator in the force control when considering both the 
position and orientation of the end-effector. As to the contact force, the contact surface 
is selected as a plane in the workspace, as shown in Fig.3.3a. The end-effector is 
controlled to offer a desired contact force on the contact surface while tracking a 
given path on it. In the control process, the orientation of the end-effector is wished 
to keep constant. 

This chapter mainly consists of three parts, firstly, a comparative simulation 
between the proposed controller and pseudo-inverse of Jacobian matrix(PJMI) based 
method is firstly discussed, and then the effectiveness of the proposed adaptive con- 
troller is checked via more cases. In addition, more discussion about the superiority 
of the proposed method is carried out to enlighten the contribution of this chapter. 


3.4.1 Simulation Setup 


In this chapter, the initial value of joint angles is set as 6) = [0, 2/3, 0, 7/3, 0, 
1/3, 0]'rad, and the corresponding coordinate of the end-effector is noted as PO. The 
initial value of joint velocity is set as A = [0, 0, 0, 0, 0, 0, 0] "rad/s. The contact sur- 
face is defined as a horizontal plane with z = 0.094m, and the physical parameters of 
the interaction model are set as K, = 5000, Kg = 20, respectively. The limitations of 
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4 0 rm/2 0 

5 0 -7/2 0.40 
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7 0 0 0.276 
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Fig. 3.3 The architecture of 7-DOF manipulator iiwa. a Physical structure. b Table of D-H param- 
eters 


joint angles and velocities are selected as 07 = [—2, —2, —2, —2, —2, —2, 2)" rad, 
A = [2, 2, 2, 2, 2, 2, 2) rad, 0 = [—2, —2, —2, —2, —2, —2, —2]Trad/s and Öt = 
[2, 2, 2, 2, 2, 2, 2]Trad/s, respectively. The control gains of the proposed ARNN are 
set as € = 0.002, I, = diag(5000, 3000), I = 1007, a = 8, respectively. 


3.4.2 Comparative Simulation Between PJMI Methods 


Firstly, a comparative simulation between the proposed control strategy and tradi- 
tional Jacobian-inverse based methods is carried out to show the superiority of the 
RNN based controller. The robot is expected to provide a contact force of 20N at a 
fixed point P1 = [0.2, 0.6, 0.094]'m, without considering the orientation control of 
the end-effector. In traditional PJMI based methods, the joint commands are obtained 
by directly calculating the inverse of the Jacobian matrix online, and only the spe- 
cial solution is considered. Simulation results are shown in Fig. 3.4. Both controllers 
can guarantee the convergence of positional and force errors. Using the same con- 
trol gain in the outer loop, although the controller based on PJMI achieves a faster 
convergence of control errors, its output is big at the beginning of the simulation 
(with the Euclidean norm of joint velocity being about 20 rad/s), moreover, as shown 
in Fig. 3.4c, the joint angle 6s exceeds its upper bound during 0.2-1 s. In contrast, 
using the RNN based controller, both joint angles and velocities are ensured not to 
exceed their limits. It is worth noting that at about t = 0.6s, 0s reaches its upper 
limit (Fig. 3.4e), correspondingly, the joints move a relatively big range, as shown 
in Fig. 3.4f, as a result, 0s stops increasing and then converges to a group of certain 
values via self-motion. 
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Fig. 3.4 Numerical results of comparative simulation between the proposed scheme and PJMI 
based methods. a Profile of position and force errors. b Euclidean norm of joint velocities. ¢ Profile 
of joint angles using PJMI method. d Profile of joint velocities using PJMI method. e Profile of 
joint angles using the proposed method. f Profile of joint velocities using the proposed method 


3.4.3 Force Control Along Predefined Trajectories with 
Model Uncertainties 


In this subchapter, we will carry out a group of experimental tests to further ver- 
ify the validity of RNN based admittance controller (3.11). In terms of the inter- 
action parameters, we assume the nominal values of K, and Kg is 4500 and 15, 
respectively. As to the kinematic parameters, the nominal value of a, is set to be 
dx(0) = [D1 (0), D3(0), Ds(0), D7(0)]|" = (0.36, 0.4, 0.42, 0.25]'m. 
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(1) Force Control On Fixed Points 

Similar to Sect.3.4.2, a motion-force control at fixed points is studied first. When 
simulation begins, the target point is set as P1 = [0.2, —0.6, 0.094]'m, at t = 5s, 
the target point is reset to P2 = [0.2, —0.4, 0.094]'m. During the simulation, the 
contact force between the end-effector and contact surface is selected as Fy = 20N. 
Numerical results are shown in Figs. 3.5 and 3.6. 

The position error when the simulation begins is about 0.2 m, accordingly, the 
proposed RNN based controller generates a large output, which ensures the quick 
convergence of both motion and force errors. The stabilization time is about 0.5 s. 
Att =5s, the target point is switched to P2, leading to an instantaneous change of 
position error. Using the adaptive admittance controller (3.11), the robot adjusts its 
joint configurations quickly and then slows down with the convergence of errors. It 
is remarkable that the second joint reaches its maximum value, and during the whole 
process, the joint velocities are guaranteed not to exceed the predefined limits. The 
estimated values of R, and Kq are shown in Fig. 3.5f, although the exact values of 
Kg and K, are unknown, by updating R, and K p online according to (3.11), precise 
control af control is achieved. The differétios between the task-space speed x and 
its estimate value Ya, is shown in Fig.3.5g, correspondingly, Di. Do, D; and Da 
converge to a group of constant value. 


(2) Force Control Along A Circular Path 

In this example, the end-effector is controlled to offer constant contact force 
Fa = 20N while tracking a circular trajectory on the contact surface, this trajec- 
tory is defined as xy = [—0.1 + 0.1cos(0.5t), —0.6 — 0.1sin(0.5t), 0.094]'m, and 
the orientation is required to remain the same as the initial state. Numerical results 
are shown in Figs.3.7 and 3.8. As shown in Fig.3.7a, the robot tracks the desired 
path successfully, and both position and orientation errors converge to zero in less 
than | s, the expected contact force is also obtained. Because of the periodicity of 
the desired commands, the robot's joint angles and angular velocities change peri- 
odically, at the same time, boundedness of 6 and Å is also guaranteed. On the other 
hand, the smooth change of @ and 6 shows that the proposed controller is very stable. 
Based on the adaptive strategy (3.11d), the system shows great robustness against 
uncertain system parameters. 


(3) Force Control Along A Rhodonea Path 

In this example, we consider the case where the robot provides a time-varying contact 
force while tracking a Rhodonea path. The desired contact force is set to be Fy = 
20 + 5sin(0.2t)N, and the Rhodonea path is defined as 


Xax(t) = 0.1sin(0.4t)cos(0.2t), 
xay (t) = 0.15sin(0.4t)sin(O0.2t) — 0.6, 
Xaz(t) = 0.094. 
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Fig. 3.5 Numerical results of force control at fixed points with uncertain model parameters. a 
Profile of positional error. b Profile of orientational error. ¢ Profile of contact force. d Profile of 
joint angles. e Profile of joint speed. f Profile of the estimated interaction coefficients. g Profile of 
lY âk — xl. h Profile of the estimated physical parameters 
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Fig. 3.6 Snapshots when iiwa offers a constant contact force at fixed points. a Snapshot when 
t = 2s. b Snapshot when t = 7s 


Numerical results are shown in Figs. 3.9 and 3.10. Figure 3.9a, b show the positional 
and orientational errors the end-effector respect to the desired path, respectively. In 
the steady-state, accurate motion control is realized using the proposed controller, and 
the operation force between the end-effector and contact surface is shown in Fig. 3.9c. 
At the beginning stage, the joint speed is high, which enables the end-effector to move 
toward the desired path rapidly. As the end-effector approached the expected path, 
the robot moves at a low speed periodically and smoothly, correspondingly, the joint 
angles changes at the same frequency. The Euclidean norm of Y âp — x is illustrated in 
Fig. 3.9g, the proposed RNN based control strategy could calculate control command 
@ online with subject to model uncertainties. The estimated model parameters are 
given in Fig. 3.10f, h. 


3.4.4 Comparison 


To further illustrate the contribution of the proposed force control strategy, we provide 
a comparison between the proposed method and the existing related methods, as 
shown in the Table3.1. In [11], an adaptive admittance control scheme based on 
neural network approximation capability is proposed and the admittance adaptive 
tracking error is optimized. However, no physical constraints are considered. In 
[10], although the established impedance controller can guarantee input saturation, 
the controller still needs to calculate the pseudo-inverse of the jacobian. In [25, 32], 
a pseudo-inverse-free controller based on RNN is designed to realize the task space 
tracking of redundant robots, and its convergence is proved. Convex optimization 
and non-convex optimization are also obtained. Considering physical uncertainties, 
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Fig. 3.7 Numerical results of force control along a circular curve with uncertain model parameters. 
a Profile of positional error. b Profile of orientational error. c Profile of contact force. d Profile of 
joint angles. e Profile of joint speed. f Profile of the estimated interaction coefficients. g Profile of 
the estimated physical parameters. h Profile of the objective function ||| 5 
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Fig. 3.8 Snapshots when iiwa offers a constant contact force along a circular curve. a Snapshot 
when t = 8s. b Snapshot when ¢ = 15s 


Table 3.1 Comparisons among different tracking controllers on manipulators 


Method Handling Pseudo- Force Handling Convergence | Secondary 
model inverse tracking inequality task 
uncertainties | Required versus constraints 

trajectory 
tracking 

This chapter | Yes No Force Yes Yes Yes 
Yes Yes Force Restrictive? | Yes Yes 

[10] 

Yes Yes Force No Yes No 

[11] 

Restrictive? | No Kinematic Yes Yes Yes 

[21, 22] 

No No Kinematic Yes Yes Yes 

[25, 32] 


In [10], only the input saturation is considered 
b Tn [21, 22], the authors only consider the uncertainties of physical parameters, while the contact 
force is ignored 


two different adaptive strategies are proposed in [21, 22]. This is the first RNN-based 
force controller in this chapter. On the other hand, this control scheme is suitable for 
the case of model uncertainty, so it is no longer necessary to calculate the pseudo- 
inverse of the Jacobian matrix and has great application potential in force control. 


3.5 Summary 


In this chapter, we propose an adaptive admittance control method for redundant 
robots based on a recursive neural network. The convergence of the adaptive RNN 
is proved by the theoretical derivation of the Lyapunov technique, and the effective- 
ness of the control strategy is verified by numerical simulation on the 7-DOF robot 
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Fig. 3.9 Numerical results of force control along a Rhodonea curve with uncertain model parame- 
ters. a Profile of positional error. b Profile of orientational error. c Profile of contact force. d Profile 
of joint angles. e Profile of joint speed. f Profile of the estimated interaction coefficients. g Profile 
of || Yâ — x| 5. h Profile of the estimated physical parameters 
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Fig. 3.10 Snapshots when iiwa offers a constant contact force along a Rhodonea curve. a Snapshot 
when t = 4s. b Snapshot when ¢t = 19s. c Snapshot when t = 27s 


iiwa. Compared with the existing control methods, the controller not only has better 
performance in dealing with physical constraints but also has better performance in 
eliminating pseudo-inversion calculation. Finally, it is worth noting that this is the 
first time that an RNN-based approach has been extended to force control for the 
redundant manipulator, especially for model uncertainty manipulators. The research 
of this subject is of great significance to grinding robot, assembly robot and industrial 
application. 
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Chapter 4 ®) 
Deep RNN Based Obstacle Avoidance sic 
Control for Redundant Manipulators 


Abstract In this chapter, we consider the obstacle avoidance problem of redundant 
robot manipulators with physical constraints compliance, where static and dynamic 
obstacles are investigated. Both the robot and obstacles are abstracted as two crit- 
ical point sets, respectively, relying on the general class-K functions, the obstacle 
avoidance problem is formulated into an inequality in speed level. The minimal- 
velocity-norm (MVN) is regarded as the cost function, converting the kinematic 
control problem of redundant manipulators considering obstacle avoidance into a 
constraint-quadratic-programming problem, in which the joint angles and joint veloc- 
ity constraints are built in velocity level in form of inequality. To solve it, a novel 
deep recurrent neural network based controller is proposed. Theoretical analyses and 
the corresponding simulative experiments are given successively, showing that the 
proposed neural controller does not only avoid collision with obstacles, but also track 
the desired trajectory correctly. 


4.1 Introduction 


With development of intelligent manufacturing and automation, the research on 
robot manipulators is obtaining increasing attention from a large number of scholars, 
numerous fruits have been reported on painting, welding and assembly [1, 2] and 
so on. With the popularization of robots, higher requirements such as flexibility and 
execution ability are imposed on robots, especially working in the complicated envi- 
ronment [3]. Consequently, more and more scholars cast light on redundant robots 
which show better flexibility, responsiveness [4, 5]. 

Stem from the consideration of human-machine collaboration, robots are no longer 
arranged in a separate area [6-8], which makes the obstacle avoidance for robots 
become an important part of kinematic control of the robot manipulators. There 
has reported many obstacle avoidance methods applicable to robot manipulators. A 
modified RRT based method, namely Smoothly RRT, was proposed in [9]. This paper 
established a maximum curvature constraint to obtain a smooth curve when avoiding 
obstacles. Compared to the traditional RRT based method, the proposed method 
achieves faster convergence. In [10], Hsu investigated the probabilistic foundations 
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of PRM based methods, obtaining a conclusion that the visibility properties has a 
heavier impact on the probability, and the convergence would be faster if extract 
partial knowledge could be introduced. However, due to the heavy computational 
burdens, those methods can be hardly used online. 

Apart from stochastic sampling based algorithms mentioned above, the artificial 
potential field method is also a potential method for obstacle avoidance, and have been 
found their application in [11—15]. Taking advantage of redundant DOFs, obstacles 
can be avoided by the self-motion in the null space. Using pseudo-inverse of Jacobian 
matrix, the solution can be built as the sum of a minimum-norm particular solution 
and homogeneous solutions [16-18]. 

With parallelism and easier to implement in hardware, neural networks have been 
a powerful tool in robot control. Artificial intelligence algorithms based on neural 
networks provide a new view for robotic control, these methods are very promising 
due to neural networks’ excellent learning ability [19]. For example, in [20], a neural 
network based learning scheme was proposed to handle functional uncertainties. 
In [21], a bio-mimetic hybrid controller was designed, where the control strategy 
consist of an RBF neural network based feed-forward predictive machine and a 
feedback servo machine based on a proportional-derivative controller. In [22], a fuzzy 
logic controller is proposed for long-term navigation of quad-rotor UAV systems 
with input uncertainties. Experiment results show that the controller can achieve 
better control performance when compared to their singleton counterparts. In [23], 
an online learning mechanism is built for visual tracking systems. The controller 
uses both positive and negative sample importance as input, and it is shown that the 
proposed weighted multiple instance learning scheme achieves wonderful tracking 
performance in challenging environments. The system model of robot manipulators is 
highly nonlinear, however, if the prior information of the model is known in advance, 
the neural network can be optimized. This is to say, on one hand, the number of nodes 
in neural networks can be reduced. In addition, the excellent learning efficiency 
is maintained simultaneously [24]. Therefore, to achieve the real-time control of 
robot manipulators, a series of dynamic neural network are proposed, such as [25- 
27]. For kinematic control of redundant manipulators, such a time-varying problem 
will be transformed into a quadratic programming from perspective of optimization, 
where nonlinear mapping from joint space to cartesian space is abstracted as a linear 
equation. Dynamic neural networks can be used to solve the quadratic-programming 
problem online, therefore, the kinematic control of manipulators is achieved when 
the formulated linear equation is ensured. More importantly, these methods can 
also handle inequality constraints considering joint physical constraints, and model 
uncertainties [28-32]. There are few works on obstacle avoidance using dynamic 
neural network. In [33], the obstacle avoidance problem is considered as an equality 
constraint, however the parameters of the escape velocity is not easy to get. In [34], 
the distance between the robot and obstacles are formulated as a group of distances 
from critical points and robot links. On this basis, an improved method is proposed 
by Guo et. al. in [35], which can suppress undesirable discontinuity in the original 
solutions. 
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Motivated by the above observations, in this chapter, a RNN-based obstacle avoid- 
ance strategy was proposed for redundant robot manipulators. Both the robot and 
obstacles are abstracted as two critical point sets, respectively, relying on the class-K 
functions, the obstacle avoidance problem is formulated into an inequality in speed 
level. The minimal velocity-norm (MVN) is regarded as the cost function, converting 
the kinematic control problem of redundant manipulators considering obstacle avoid- 
ance into a constraint-quadratic-programming problem, in which the joint angles and 
joint velocity constraints are built in velocity level in form of inequality. To solve it, a 
novel deep recurrent neural network based controller is proposed. Theoretical anal- 
yses and the corresponding simulative experiments are given successively, showing 
that the proposed neural controller does not only avoid collision with obstacles, but 
also track the desired trajectory correctly. The main contributions of this chapter are 
summarized as below: 


e A deep RNN-based controller is proposed. Four objectives are achieved at the 
same time, i.e, the desired path tracking, obstacle avoidance, physical constraints 
compliance considering joint angles and velocity constraints and optimality of 
cost functions. 

e Relying on a class-K function, a novel obstacle avoidance strategy is given, where 
robots and obstacles are abstracted into a set of critical point sets, the distance 
between them can be described as the point-to-point distance. 

e Numerical results show that the effectiveness of the designed RNN controller, i.e, 
the static and dynamic obstacles can be avoided while tracking the desired motion 
trajectory. 


4.2 Problem Formulation 


4.2.1 Basic Description 


When a redundant robot is controlled to track a particular trajectory in the cartesian 
space, the positional description of the end-effector can be formulated as 


x= f@), (4.1) 


where x € R” and 6 € R” are the end-effector’s positional vector and joint angles, 
respectively. In the velocity level, the kinematic mapping between x and 6 can be 
described as 

x = J(0)6, (4.2) 


where J(@) € R”*” is the Jacobian matrix from the end-effector to joint space. 

In engineering applications, obstacles are inevitable in the workspace of a robot 
manipulator. For example, robot manipulators usually work in a limited workspace 
restricted by fences, which are used to isolate robots from humans or other robots. 
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Fig. 4.1 The basic idea of 
obstacle avoidance in this 
chapter 


This problem could be even more acute in tasks which require collaboration of 
multiple robots. Let Cı be the set of all the points on a robot body, and C2 be the 
set of all the points on the obstacles, then the purpose of obstacle avoidance of a 
robot manipulator is to ensure Cı U C2 = Ø at all times. By introducing d as a safety 
distance between the robot and obstacles, the obstacle avoidance is reformulated as 


|O;Ai| = d, VA; € C), VO; € Co. (4.3) 


where |O;A;| = Ji — O;)'(A; — Oj) is the Euclidean norm of the vector A; O;. 

Equation (4.3) gives a basic description of obstacle avoidance problem in form 
of inequalities. However, there are too many elements in sets Cı and C2, the vast 
majority of which are actually unnecessary. Therefore, by uniformly selecting points 
of representative significance from C, and C3, and increasing d properly, Eq. (4.3) 
can be approximately described as below 


|O;Ai| = d, (4.4) 
with Aj,i=1,...,a and Oj, j =1,...,b being the representative points of the 


robot and obstacles, respectively. The schematic diagram of Eq. (4.4) in shown in 
Fig. 4.1. 


4.2.2 Reformulation of Inequality in Speed Level 


In order to guarantee the inequality (4.4), by defining D = |O; A;| — d, an inequality 
is rebuilt in speed level as 


d(|Oj;Aj|)/dt > —sgn(D)g(|D)), (4.5) 


in which g(e) belongs to class-K. Remarkable that the velocities of critical points 
A; can be described by joint velocities 


4.2 Problem Formulation 67 
Aj = Ja (0È, (4.6) 


where Ja; € R”*” is the Jacobian matrix from the critical point A; to joint space. If 
Oj; is prior known, the left-side of Eq. (4.5) can be reformulated as 


d d 
= OjAil) =E (Ai = "Ai = 07) 


1 ; i 
=o a] ^ - 0)" (å; — Ò;) 
J L 
—>r . —r: 
=|O;jAi| Jai(9)0 — |O;Ai| Oj, (4.7) 


where |O; A;| = (A; — O;)'/|O;A;| € R'*” is the unit vector of A; — O;. There- 
fore, the collision between critical point A; and object O; can be obtained by ensuring 
the following inequality 


% —> r; 
Joi < sgn(D)g(|D|) — |OjAil| Oj, (4.8) 
where J,; = —|O;A;|" Jai € R'*”. Based on the inequality description (4.8), the 


collision between robot and obstacle can be avoided by ensuring 


JÊ < B, (4.9) 

where J, = eae tee, Ji tee, i tee, Joy e R” is the concatenated form 
b b 

of J,; considering all pairs between A; and Oj, B=[By,---, Biy,--: , Bai, 


- , Bay|' € R® is the vector of upper-bounds, in which B;; = sgn(D)g(|D|) — 
—— n 
|O; Ail" O}. 
Remark 4.1 According to Eq. 4.5 and the definition of class-K functions, the origi- 
nal escape velocity based obstacle avoidance methods in [34, 35] can be regarded as 
a special case of 4.5, in which G(| D|) is selected as G(| D|) = k| D|. Therefore, in 


this chapter, the proposed obstacle avoidance strategy is more general than traditional 
methods. 


4.2.3 QP Type Problem Description 


As to redundant manipulators, in order to take full advantage of the redundant DOFs, 
the robot can be designed to fulfill a secondary task when tracking a desired trajec- 
tory. In this chapter, the secondary task is set to minimize joint velocity while avoid- 
ing obstacles. In real implementations, both joint angles and velocities are limited 
because of physical limitations such as mechanical constraints and actuator satura- 
tion. Because of the fact that rank(J) < n, there might be infinity solutions to achieve 
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kinematic control. In this chapter, we aim to design a kinematic controller which is 
capable of avoiding obstacles while tracking a pre-defined trajectory in the Cartesian 
space. For safety, the robot is wished to move at a low speed, in addition, lower 
energy consumption is guaranteed. By defining an objective function scaling joint 
velocity as 6"6/2, the tracking control of a redundant manipulator while avoiding 
obstacles can be described as 


min 6176/2, (4.10a) 
S.t. X= Xd, (4.10b) 
0T <6 <6", (4.10c) 
6- <6 <6t, (4.10d) 
JÊ < B. (4.10e) 


It is remarkable that the constraints Eq. (4.10b)—(4.10e) and the objective function 
(4.10a) to be optimized are built in different levels, which is very difficult to solve 
directly. Therefore, we will transform the original QP problem (4.10) in the velocity 
level. In order to realize precise tracking control to the desired trajectory xq, by 
introducing a negative feedback in the outer-loop, the equality constraint can be 
ensured by letting the end-effector moves at a velocity of x = xa + k(xa — x). In 
terms with (4.10c), according to the escape velocity method, it can be obtained by 
limiting joint speed as (07 — 0) < Ê < a(6+ — 0), where « is a positive constant. 
Combing the kinematic property (4.2), the reformulated QP problem is 


min 6'6/2, (4.11a) 
s.t. J(9)0 = ža + k(xa — x), (4.11b) 
max(a(6~ — 0), ÖT) < 6 < min(6t, a(0* — 0)), (4.110) 
J, < B. (4.11d) 


It is noteworthy that both the formula (4.11a) and (4.11d) are nonlinear. The 
QP problem cannot be solved directly by traditional methods. Using the parallel 
computing and learning ability, a deep RNN will be established later. 


4.3 Deep RNN Based Solver Design 


In this chapter, a deep RNN is proposed to solve the obstacle avoidance problem 
(4.11) online. To ensure the constraints (4.1 1b), (4.1 1c), and (4.11d), a group of state 
variables are introduced in the deep RNN. The stability is also proved by Lyapunov 
theory. 
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4.3.1 Deep RNN Design 


Firstly, by defining a group of state variables 4, € R”, Az € R, a Lagrange function 
is selected as 


L = 6076/2 + AT ha + k(xa — x) — J(0)6) +45 (Jo6 — B), (4.12) 
A; and Az are the dual variables corresponding to the constraints (4.1 1b) and (4.1 14d). 


According to Karush-Kuhn-Tucker conditions, the optimal solution of the optimiza- 
tion problem (4.12) can be equivalently formulated as 


6=P (6 — 24) (4.13a) 
TAMY age 
J(0)6 = ia +k(xa — x), (4.13b) 


eae if J,0 =B, (4.130) 


à2=0 if J< B, 


where Po(x) = argmin,.¢||y — x|| is a projection operator to a restricted interval 
2, which is defined as 2 = {6 € R"|max(a(@~ — 0), ÒT) < 6 < min(6t, a(t — 
@))}. Notable that Equation (4.13c) can be simply described as 


do = (Ao + Jĝ — BY, (4.14) 


where (e)* is a projection operation to the non-negative space, in the sense that 
yt = max(y, 0). 

Although the solution of (4.13) is exact the optimal solution of the constrained- 
optimization problem (4.11), it is still a challenging issue to solve (4.13) online since 
its inherent nonlinearity. In this chapter, in order to solve (4.13), a deep recurrent 
neural network is designed as 


68 = —6 + Po (JT — JIM), (4.15a) 
eh, = ža + k(xa — x) — J (È, (4.15b) 
Be =—le+ Go + JÔ — BY, (4.15c) 


with € is a positive constant scaling the convergence of (4.15). 


Remark 4.2 As to the established deep RNN (4.15), the first dynamic equation is 
also the output dynamics, which combines the dynamics of state variables à; and 
Az, as well as the physical limitations including joint angles and velocities. State 
variable à; is used to ensure the equality constraint (4.11b), as shown in (4.15b), A, 
is updated according to the difference between reference speed xg + k(xa — x) and 
actually speed J (@)6. Similarly, Az is used to ensure the inequality constraint (4.114), 
which will be further discussed later. It is remarkable that £ plays an important role in 
the convergence of the deep RNN. The smaller £, the faster the deep RNN converges. 
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Remark 4.3 By introducing the model information such as J, Jo, etc., the estab- 
lished deep RNN consists of three class of nodes, namely 6, A, and A», and the total 
number of nodes is n + m + ab. Comparing to traditional neural networks in [19], 
the complexity of neural networks is greatly reduced. 


4.3.2 Stability Analysis 


In this part, we offer stability analysis of the obstacle avoidance method based on 
deep RNN based. First of all, some basic Lemmas are given as below. 


Definition 4.1 A continuously differentiable function F (e) is said to be monotone, 
if VF + VF’ is positive semi-definite, where VF is the gradient of F (e). 


Lemma 4.1 A dynamic neural network is said to converge to the equilibrium point 
if it satisfies 
KX = —xX + Ps(x — pF(x)), (4.16) 


where k > 0 and p > 0 are constant parameters, and Ps = argmin,.s||y — x|| is a 
projection operator to closed set S. 


Lemma 4.2 [37] Let V : [0,00) x By > R be a C! function, ot, œz be class-K 
functions defined on [0, d) which satisfy œi (||x||) < V(t, x) < a2(||x||), Y(t, x) € 
[0, d) x Ba, then x = O is a uniformly asymptotically stable equilibrium point if there 
exist some class-K function g defined on [0, d), to make the following inequality hold 


ae + Nee x) < —a(||x||), V(t, x) € [0, œ) x Ba, (4.17) 
ot Ox 


About the stability of the closed-loop system, we offer the following theorem. 


Theorem 4.1 Given the obstacle avoidance problem for a redundant manipulator 
in kinematic control tasks, the proposed deep recurrent neural network is stable and 
will globally converge to the optimal solution of (4.10). 


Proof The stability analysis consists of two parts: firstly, we will show that the 
equilibrium of the deep RNN is exactly the optimal solution of the control objective 
described in (4.11), which prove that the output of deep RNN will realize obstacle 
avoidance while tracking a given trajectory, and then we will prove that the deep 
recurrent neural network is stable in sense of Lyapunov. 

Part I. As to the deep recurrent neural network (4.15), let (6*, Ay, A5) be the 
equilibrium of the deep RNN, then (6*, A*, 43) satisfies 


—6* + Po(JTA* — F704) =0, (4.18a) 
ža + k(xa — x) — J(0)6* =0, (4.18b) 
—AS + (AS + J,6* — B)t =0, (4.18c) 
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with 6* be the output of deep RNN. By comparing Equation (4.18) and (4.13), we 
can readily obtain that they are identical, i.e., the equilibrium point satisfies the KKT 
condition of (4.10). 

Then we will show that the equilibrium point (output of the proposed deep RNN) 
is capable of dealing with kinematic tracking as well as obstacle avoidance problem. 
Define a Lyapunov function V about the tracking error e = xg — x as V = e'e/2, 
by differentiating V with respect to time and combining (4.11b), we have 


V = elè = e" (ża — J(0)6") 
= —ke"e <0, (4.19) 
in which the dynamic equation (4.18b) is also used. It can readily obtained that the 


tracking error would eventually converge to zero. 
It is notable that the dynamic equation (4.18c) satisfies 


M+ J,6* — B— (5 + J,6" — B)* = J,6* — B. (4.20) 


According to the property of projection operator (e)*, y — (y)* < 0 holds for any 
y, then we have J,0* — B < 0, together with (4.5), the inequality (4.5) is satisfied. 
Notable that (4.5) can be rewritten as 


D > —sgn(D)g(|D)). (4.21) 


As to (4.21), we first consider the situation when equality holds. Since g(|D|) 
is a function belonging to class K, it can be easily obtained that D = 0 is the only 
equilibrium of D= —sgn(D)g(|D|). Define a Lyapunov function as V2(t, D) = 
D?/2, and select two functions as a(|D|) = œ2(|D|) = D?/2. It is obvious that 
œı (| D|) = æ2(| D|) belongs to class-K, and the following inequality will always hold 


a1 (|D|) < Va(t, D) < a2(|D)). (4.22) 
Taking the time derivative of V(t, D), we have 


dV, OV. 
D 
ot aD 


= —|D\g(IDI) < 0. (4.23) 


According to Lemma 4.2, the equilibrium x = 0 is uniformly asymptotically sta- 
ble. Then we arrive at the conclusion that if the equality d(|O; A;|)/dt = —sgn(D)g 
(|D|) holds, |D| = 0 will be guaranteed, i.e., |O; A;| — dforalli = 1---a,=1---b. 
Based on comparison principle, we can readily obtain that |O;A;| > d when 
a(10;A;l)/dt > —sgn(D)g(|D)). 

Part II. Then we will show the stability of the deep RNN (4.15). Let E = 
[67, AT, AT]" be the a concatenated vector of state variables of the proposed deep 
RNN, then (4.15) can be rewritten as 
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e =f + Polé — FO), (4.24) 


where Ps(e) is a projection operator to a set S, and F (£) = [Fi (£), Fa (€), F3(€)]" € 
Rrtm+ab in which 


Fi Ô — JT + JIM 
Fy = JO — xa —k(xq — x) 
F; —J,6* — B 


Let VF = 0F/0€, we have 


I -st JI 
VF~é)=| J 0 oj. (4.25) 
-JZ 0 0 


According to the definition of monotone function, we can readily obtain that F (&) is 
monotone. From the description of (4.24), the projection operator Ps can be formu- 
lated as Ps = [Pg; Pr; Pa], in which Po is defined in (4.13), Pr can be regarded 
as a projection operator of à; to R, with the upper and lower bounds being +ooọ, 
and P, = (e)* is a special projection operator to closed set RY? . Therefore, Ps is a 
projection operator to closed set [2; R”; R4]. Based on Lemma 4.1, the proposed 
neural network (4.15) is stable and will globally converge to the optimal solution of 
(4.10). The proof is completed. 


4.4 Numerical Results 


In this chapter, the proposed deep RNN based controller is applied on a planar 4- 
DOF robot. Firstly, a basic case where the obstacle is described as a single point 
is discussed, and then the controller is expanded to multiple obstacles and dynamic 
ones. Comparisons with existing methods are also listed to indicate the superiority 
of the deep RNN based scheme. 


4.4.1 Simulation Setup 


The physical structure of the 4-link planar robot to be simulated is shown in 
Fig. 4.2, in which the critical points of the robot are also marked. As shown in 
Fig. 4.2, critical points A2, A4, A6 are selected at the joint centers, and A1, A3, As, 
A7 are selected at the center of robot links. It is notable that A; and the Jacobian 
matrix J,; are essential in the proposed control scheme. Based on the above descrip- 
tion of A;, the D-H parameters of A, is aj = 0.15, a2 = a3 = 0, &1 = a2 = a3 = 0, 
dı = d = d; = 0, then both the position and Jacobian matrix J,; of A; can be calcu- 
lated readily. Based on the definition in Eq. (4.8), J,; can be obtained. A; and J,; can 
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Fig. 4.2 The planar robot to 
be simulated in this chapter 


be calculated similarly. The control parameters are set as £ = 0.001, a = 8, k = 8. 
As to the physical constraints, the limits of joint angles and velocities are selected 
as 0; = —3rad, o7 = 3rad, 6 = —lrad/s, ĝt = lrad/s fori = 1...4. The safety 
distance d is set to be 0.1 m. 


4.4.2 Single Obstacle Avoidance 


In this simulation, the obstacle is assumed to be centered at [—0.1, 0.2]"m, the 
desired path is set as xg = [0.4 + 0.1cos(0.5t), 0.4 + 0.1sin(0.5t)]"m, and the ini- 
tial joint angles are set to be 0) = [7 /2, —7 /3, —7/4, O]'rad. The class-K function 
is selected as G(|D|) = Kı|D| with Kı = 200. In order to show the effectiveness 
of the proposed obstacle avoidance method, contrast simulations with and without 
inequality constraint (4.10e) are conducted. Simulation results are shown in Fig. 4.3. 
When ignoring the obstacle, the end-effector trajectories and the corresponding incre- 
mental configurations are shown in Fig. 4.3a, although the robot achieves task space 
tracking to xg, obviously the first link of the robot would collide with the obstacle. 
After introducing obstacle avoidance scheme, the robot moves other joints rather than 
the first joint, and then avoids the obstacle effectively (Fig.4.3b). Simultaneously, 
the tracking errors when tracking the given circle are shown in Fig. 4.3c. From the 
initial state, the end-effector moves towards the circle quickly and smoothly, after 
that, the tracking error in stable state keeps less than 1 x 1074m, showing that the 
robot could achieve kinematic control as well as obstacle avoidance tasks. To show 
more details of the proposed deep RNN based method, some important process data 
is given. As the obstacle is close to the first joint, critical points A; and A, are more 
likely to collide with the obstacle, therefore, as a result, the distances between the 
obstacle O; and Aj, Az are shown in Fig.4.3d, from tf = 2s to t = 6.5s, ||A;O1]| 
remains at the minimum value d = 0.1, that is to say, using the proposed obstacle 
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Fig. 4.3 Numerical results of single obstacle avoidance. a is the motion trajectories when ignor- 
ing obstacle avoidance scheme, b is the motion trajectories when considering obstacle avoidance 
scheme, c is the profile of tracking errors when considering obstacle avoidance scheme, d is the 
profile of distances between critical points and obstacle, e is the profile of joint velocities, f is the 
profile of joint angles 


avoidance method, the robot maintains minimum distance from the obstacle. The 
profile of joint velocities are shown in Fig. 4.3e, at the beginning of simulation, the 
robot moves at maximum speed, which leads to the fast convergence of tracking 
errors. The curve of joint angles change over time is shown in Fig. 4.3f. 


4.4.3 Discussion on Class-K Functions 


In this part, we will discuss the influence of different class-K functions in the avoid- 
ance scheme (4.5). Four functions are selected as Gi (|D) = K|D|?, G2(|D|) = 
K|D|,G3(|D|) = Ktanh(5|D|), G4(|D|) = Ktanh(10| D|), Fig. 4.4a shows the com- 
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Fig. 4.4 Discussions on different obstacle avoidance functions. a is the comparative curves of 
different obstacle avoidance functions. b is the profile of minimum distance of the robot and obstacle 
using different obstacle avoidance functions 


parative curves the these functions. Other simulation settings are the same as the 
previous one. Simulation results are shown in Fig.4.4b. When selecting the same 
positive gain K, the minimum distance is about 0.08 m, which shows the robot can 
avoid colliding with the obstacle using the avoidance scheme (4.5). The close-up 
graph of the tracking error is also shown, it is remarkable that the minimum distance 
deceases, as the gradient of the class-K function increases near zero. Therefore, one 
conclusion can be drawn that the function can be more similar with sign function, to 
achieve better obstacle avoidance. 


4.4.4 Multiple Obstacles Avoidance 


In this part, we consider the case where there are two obstacles in the workspace. The 
obstacles are set at [0.1, 0.25]'m and [0, 0.4]"m, respectively. Simulation results are 
shown in Fig.4.5. The desired path is defined as xa = [0.45 + 0.1cos(0.5t), 0.4 + 
0.1sin(0.5t)]". The initial joint angle of the robot is selected as 6 = [1.5, —1 — 
1, 0]". To further show the effectiveness of the proposed obstacle avoidance strategy 
4.5, g|D| is selected as g|D| = K,/(1 + e7?!) — K,/2 with Kı = 200. When A, 
is set to 0, as shown in Fig.4.5a, the inequality constraint (4.1 1d) will not work, 
in other words, only kinematic tracking problem is considered rather than obstacle 
avoidance, in this case, the robot would collide with the obstacles. After introducing 
online training of 2, the simulation results are given in Fig.4.5b—-h. The tracking 
errors are shown in Fig. 4.5c, with the transient time being about 4s, and steady state 
error less than 1 x 107*m. Correspondingly, the robot moves fast in the transient 
stage, ensuring the quick convergence of the tracking errors. It is remarkable that 
the distances between the critical points and obstacle points are kept larger than 
0.1m at all times, showing the effectiveness of the proposed method. At t = 14s, 


76 4 Deep RNN Based Obstacle Avoidance Control for Redundant Manipulators 


0.75 0.75 

Bos) Gee q 

oo Š 0.25 ° 
-0.25 


| —||A20,||| 
$ ---- || 401l] 
|| 401l] 

| llA202l]| 
—j— ||A3Oo||} 
|| 402l] 


Distances (m) 
© 
N 


Tracking error(m) 


10 15 20 0 5 10 15 20 


~~~ Ag(1) 
—— A2(2) 
~~ A2(3) 
— A2(4) 
A2(5) | 
A2(6) 


State variables 


Joint velocities(rad/s) 


t(s) 


~3 

g2 3 
gi E 
h 0 z 
g > 
Saj 2 
g 3 
= 2 + 
s - 

0 5 10 15 20 
t(s) 


(h) 


Fig. 4.5 Numerical results of multiple obstacle avoidance. a is the motion trajectories when ignor- 
ing obstacle avoidance scheme. b is the motion trajectories when considering obstacle avoidance 
scheme. c is the profile of tracking errors when considering obstacle avoidance scheme. d is the 
profile of distances between critical points and obstacles. e is the profile of joint velocities. f is the 
profile of A2. g is the profile of joint angles. h is the profile of 41 
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from Fig. 4.5d and g, when the distance between A3 and O; is close to 0.1m, the 
corresponding dual variable 42 becomes positive, making the inequality constraint 
(4.1 1d) hold, and the boundary between the robot and obstacle is thus guaranteed. 
After t = 18s, ||A30,|| becomes greater, then A2 converges to aero. Notable that 
although à; and Az do not converge to certain values, the dynamic change of A; and 
Az ensures the regulation of the proposed deep RNN. 


4.4.5 Enveloping Shape Obstacles 


In this part, we consider obstacles of general significance. Suppose that there is a 
rectangular obstacle in the workspace, with the vertices being [0, 0.5], [0.4, 0.5], 
[0.4, 0.6] and [0.5, 0.6]', respectively. By selecting the safety distance d = 0.1m, 
and obstacle points as O; = [0.05, 0.55], O2 = [0.15, 0.55]", O3 = [0.25, 0.55]? 
and O4 = [0.35, 0.55]. It can be readily obtained that the rectangular obstacle is 
totally within the envelope defined by O; and d. The incremental configurations 
when tracking the path while avoiding the obstacle are shown in Fig. 4.6b, in which 
a local amplification diagram is also given, showing that the critical points A3 is 
capable of avoiding O2 and O3. It is worth noting that by selecting proper point 
group and safety distance, the obstacle can be described by the envelope shape 
effectively. Figure 4.6c, h also give important process data of the system under the 
proposed controller, including tracking errors, joint angles, angular velocities, and 
state variables of deep RNNs. We can observe that the physical constraints as well 
as kinematic control task are realized using the controller. 


4.4.6 Comparisons 


To illustrate the priority of the proposed scheme, a group of comparisons are carried 
out. As shown in Table 4.1, all the controllers in [12, 16, 34, 35] achieve the avoidance 
of obstacles. Comparing to APF method in [12, 16] and JP based method in [12, 16], 
the proposed controller can realize a secondary task, at the same time, we present a 
more general formulation of the obstacle avoidance strategy, which is helpful to gaina 
deeper understanding of the mechanism for avoidance of obstacles. Moreover, in this 
chapter, both dynamic trajectories and obstacles are considered. The comparisons 
above also highlight the main contributions of this paper. 
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Fig. 4.6 Numerical results of enveloping shape obstacles. a is the motion trajectories when ignor- 
ing obstacle avoidance scheme. b is the motion trajectories when considering obstacle avoidance 
scheme. c is the profile of tracking errors when considering obstacle avoidance scheme. d is the 
profile of distances between critical points and obstacles. e is the profile of joint velocities. f is the 
profile of joint angles. g is the profile of 4. h is the profile of A, 
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Table 4.1 Comparisons among different obstacle avoidance controllers on manipulators 


Method Convergence | Secondary Physical Dynamic Obstacle 
task constraints obstacles avoidance 
satisfied description 
This paper Yes Considered Yes Considered Inequalities 
Yes Considered Yes * Inequalities** 
[35] 
Yes Considered Yes $ Inequalities** 
[34] 
Yes Not No Considered Repulsion 
[12] considered 
Yes Not No = Null space 
[16] considered 


* In [34, 35] and [16], dynamic obstacles are not considered 
** Regular escape velocity method is used, which is only a special case of 4.5 


4.5 Summary 


In this chapter, a novel obstacle avoidance strategy is proposed based on a deep 
recurrent neural network. The robots and obstacles are presented by sets of criti- 
cal points, then the distance between the robot and obstacle can be approximately 
described as point-to-points distances. By understanding the nature escape veloc- 
ity methods, a more general description of obstacle avoidance strategy is proposed. 
Using minimum-velocity-norm (MVN) scheme, the obstacle avoidance together with 
path tracking problem is formulated as a QP problem, in which physical limits are 
also considered. By introducing model information, a deep RNN with simple struc- 
ture is established to solve the QP problem online. Simulation results show that the 
proposed method can realize the avoidance of static and dynamic obstacles. 
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Chapter 5 A) 
Optimization-Based Compliant Control geit 
for Manipulators Under Dynamic 

Obstacle Constraints 


Abstract The research on force control among manipulators has attracted more 
and more attention from a large of scholars and researcher. In this chapter, from 
perspective of optimization, we investigated the collision-free compliance control 
of redundant robot manipulators using recurrent neural network. The position-force 
control is constructed as an equality constraint in velocity level together with the kine- 
matic property of robots. Both the joint angles and joint speed limitations on robots 
physical structure are also considered, and are described by a group of inequality con- 
straints. To avoid collision between robots and obstacles, they are described as two 
set of points, the Euclidean norm of distance between robots and obstacles, greater 
than zero, is established as the condition of collision-free occurrence. Minimizing 
joint velocities as the secondary task, a time-varying QP-type problem description 
is given with equality and inequality constraints, then an RNN-based controller is 
designed to solve it. Based on theoretical analysis and simulative experiments, the 
effectiveness of the designed controller is validated. 


5.1 Introduction 


With development of industry society, robot manipulators are required to be more 
flexible and intelligent, to satisfy the increasing personalized and customized pro- 
duction requirement [1]. Compared to non-redundant manipulators, redundant ones 
show more flexibility due to its more DOFs that exceed the required number to accom- 
plish the given task [2]. On the other hand, position control scheme on robots would 
show lower performance for some complicated tasks [3]. For example, the control 
methods that only considers position usually ignore the contact-force between robot 
and workpieces, with high safety challenge, resulting from the excessive system 
stiffness would bring the unpredictable responses [4]. Therefore, control of contact 
force between redundant robots and workpieces should be considered. 

In the light of different robot structure and control signals, until now, a number 
of methods have been proposed. Imitating the muscle-tendon tissue of animal joints, 
compliance units such as series elastic actuators (SEA), variable stiffness actuators 
and so on, are introduced into the robots. In [5], Pan et al. proposed a compliance 
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controller for SEA-based systems, achieving torque output control. As to the inter- 
action between the robot and workpieces, Hogan proposes a basic idea of impedance 
control, in which the robot and environment usually bear as an impedance and admit- 
tance, respectively [6]. Generally speaking, the contact force and relative movement 
of the robot and workpieces can be described as a combination of mass-spring- 
damper systems. Therefore, the contact force can be controlled by designing motion 
commands indirectly. Another representative approach is hybrid position-force con- 
trol, the controller is usually designed in the torque loop of the joint space, in which 
both contact forces and movement of the robot are modelled based on dynamic anal- 
ysis. Then the controller can be described as a combination of control efforts which 
achieve position and force control respectively [7]. Similar research can be found in 
literature such as [8—13]. 

During the operation, the robot may collide with the environment because the 
manipulator usually needs to keep in touch with the workpiece. In addition, the 
working space of the robot is limited [14]. For example, in a production line with 
multiple manipulators, each robot is in a fixed position. In order to avoid interference, 
the working space of the robot is limited by hardware (fences, obstacles, [emph], 
etc.) or software constraints (pre planned space). In the case of human-computer 
cooperation, the robot shall not collide with people. Therefore, it is very important to 
avoid obstacles in the process of operation. In current reports, the desired trajectory 
is usually obtained by offline programming, which is limited by the efficiency of 
programming. In order to achieve real-time obstacle avoidance control, the artificial 
potential field method has been widely used. The basic idea is that when an obstacle 
repels the robot, the target acts as an attractive pole, and then the robot will be con- 
trolled to converge to the target without colliding with the obstacle [15]. In [16], a 
modified method is proposed, which describes the obstacles by different geometri- 
cal forms, both theoretical conduction and experimental tests validate the proposed 
method. Considering the local minimum problem that may caused by multi-link 
structures, in [17], a two minima is introduced to construct potential field, such that 
a dual attraction between links enables faster maneuvers comparing with traditional 
methods. Other improvements to artificial potential field method can be found in [18, 
19]. A series of pseudo-inverse methods are constructed for redundant manipulators 
in [20], in which the control efforts consists of a minimum-norm particular solution 
and homogeneous solutions, and the collision can be avoided by calculating a escape 
velocity as homogeneous solutions. By understanding the limited workspace, the 
obstacle avoidance can be described in forms of inequalities, which opens a new 
way in realtime collision avoidance. In [21], the robot is regarded as the sum of sev- 
eral links, and the distances between the robot and obstacle is obtained by calculating 
distances between points and links. Then Guo [22] improves the method by modify- 
ing obstacle avoidance MVN scheme, and simulation results show that the modified 
control strategy can suppress the discontinuity of angular velocities effectively. 

To solve the problem of robot compliance control, the controller should be 
designed according to the required command and system characteristics. That is 
to say, robots must follow constraints to achieve compliance control, while ensuring 
unequal constraints to avoid obstacles. Obviously, the control problem involves sev- 
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eral constraints, including equal constraints and unequal constraints. By using the 
idea of constraint optimization, the control problem with multiple constraints can 
be well dealt with. In recent years, the application of recurrent neural network in 
robot control has been studied extensively, and it shows the great efficiency of real- 
time processing [23-27]. In those literatures, analysis in dual space and a convex 
projection are introduced to handle inequality constraints. 

Recently, taking advantage of parallel computing, neural networks are used to 
solve the constraint-optimization, and have shown great efficiency in real-time pro- 
cessing. In [28, 29], controllers are established in joint velocity/acceleration level, to 
fulfill kinematic tracking problem for robot manipulators. In [30], tracking problem 
with model uncertainties is considered, and an adaptive RNN based controller is 
proposed for a 6DOF robot Jacoz. Discussions on multiple robot systems, parallel 
manipulators, time-delay systems using RNN can be found in [30-33]. 

Based on the above observations, a RNN based collision-free compliance control 
strategy is proposed for redundant manipulators. The remainder of this chapter is 
organized as follows. In Chap. 2, the control objective including the position-force 
control as well as collision avoidance is pointed out, and then rewritten as a QP 
problem. In Chap. 3, the RNN based controller is proposed, and the stability of the 
system is also analyzed. A number of numerical experiments on a 4-DOF redundant 
manipulator including model uncertainties and narrow workspace are carried out in 
Chap. 4, to further verify effectiveness of the proposed control strategy. Chapter 5 
concludes the chapter. The contributions of this chapter are summarized as below 


e To the best of the author’s knowledge, there is few research on compliance control 
using recurrent neural networks, the study in this chapter is of great significance 
in enriching the theoretical frame of RNN. 

e The proposed controller is capable of handling compliance control, as well as 
avoiding obstacles in realtime, which does make sense in industrial applications, 
besides, physical constraints are also guaranteed. 

e Comparing to traditional neural-network-based controllers used in robotics, not 
only control errors but model information is considered, therefore, the proposed 
RNN has a simple structure, and the global convergence can be ensured. 


5.2 Problem Formulation 


5.2.1 Robot Kinematics and Impedance Control 


Without loss of generality, we consider series robot manipulators with redundant 
DOFs, and the joints are assumed as rotational joints. Let 0 € R” be the vector of 
joint angles, the description of the end-effector in the cartesian space is 


x= f(6), (5.1) 
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where x € R” is the coordination of the end-effector. In the velocity level, the forward 
kinematic model can be formulated as 


x = J(0)6, (5.2) 


in which J (0) = 0x/06 is Jacobian matrix. As to redundant manipulators, J € R”*”, 
rank(J) <n. 

In industrial applications, position control based operation mode has many lim- 
itations: due to the lack of compliance, pure kinematic control methods may cause 
unexpected consequences, especially when the robot is in contact with the envi- 
ronment. To enhance the compliance and achieve precise control of contact force, 
according to impedance control technology, the interaction between robot and envi- 
ronment can be described as a damper-spring system [34]. 


F = K, Ax + Kqd(Ax)/dt, (5.3) 


where K, and Ky are interaction coefficients, and Ax = x — xq is the difference 
between the actual response x and desired trajectory xg. By referring to Eqs. (5.2) 
and (5.3), we have 

t= Kj'F — K,Kj'Ax + Xa. (5.4) 


When the real values of K, and K4 are known, F can be obtained by adjusting 
the velocity x of the end-effector according to Eq. (5.4). 


5.2.2 Obstacle Avoidance Scheme 


In the process of robot force control, there is a risk of collision as the robot may 
contact with workpieces. Besides, robot manipulators usually work in a limited 
workspace restricted by fences, which are used to isolated robots from humans or 
other robots. This problem could be even more acute in tasks which require col- 
laboration of multiple robots. Therefore, obstacle avoidance problem must be taken 
into consideration. When collision does not happens, the distance between robot 
and obstacles keeps positive. By describing the robot and obstacles as two separated 
sets, namely S4 = {Aj,..., Aa}, Sg = {Bi,..., Bp}, where A;, i = 1,--- ,a and 
B;, j =1,--- , b are points on the robot and obstacles, respectively. Then the suffi- 
cient and necessary condition of obstacle avoidance problem is that the intersection of 
A and B is an empty set. That is to say, for any point pair A; on the robot and B; on the 
obstacle, the distance between A; and B; is always positive, i.e., || A; B;| |2 > 0, for 
alli = 1,...,a, j =1,--- ,b, where || e Il is the Euclidean norm of vector A; B;. 
For sake of safety, let d > 0 be a proper value describing the minimum distance 
between robot and obstacles, the collision can be avoided b ensuring || A; B;| |2 >d. 
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Remark 5.1 In fact, both S4 and Sz consist of infinite points. However, by evenly 
selecting representative points from the robot link and obstacles, S4 and Sg can be 
simplified significantly. Besides, the safety distance d can be appropriately increased. 
Despite that this treatment will sacrifice some workspace of the robot (the inequality 
||A; Bj | p > d would consider some areas that collisions do not happen, due to a big- 
ger d is considered), this sacrifice is meaningful: the number of inequality constraints 
can be reduced greatly, which is helpful for constraint description and solution. 


In real applications, the key points of the robot manipulator is easy to select. 
Cylindrical envelopes are usually used to describe the robotic links, then the key 
points can be selected on the axes of the cylinders uniformly, and the distance between 
those points can be defined the same as the radius of the cylinder. As to the obstacles 
with irregular shapes, the key points can be selected based on image processing 
techniques, such as edge detection, corrosion, etc. 


5.2.3 Problem Reformulation in QP Type 


From the above description, the purpose of this chapter is to build a collision-free 
force controller for redundant manipulators, to achieve precise force control along a 
predefined trajectory, in the sense that F > Fy, x —> xq, and || A; B Ale > d for all 
i=1,---,a,j=1,...,b. 

As to aredundant manipulator, there exist redundant DOFs, which can be used to 
enhance the flexibility of the robot. When the robot gets close to the obstacles, the 
robot must avoid the obstacle without affecting the contact force and tracking errors. 
In addition, when there is no risk of collision, the robot may work in an economic way, 
by minimizing the joint velocities, energy consumption can be reduced effectively. 
Therefore, by defining an objective function as O13, the control objective can be 
summarized as 


min |l], (5.5a) 
S.t. X = Xq, (5.5b) 
F = F;, (5.50) 
ArBill5 = d, (5.5d) 


where ÒI is the Euclidean norm of 6. It is noteworthy that in actual industrial 
applications, the robot is also limited by its own physical structures. For instance, 
the joint angles are limited in a fixed range, and the upper/lower bounds of joint 
velocities are also constrained due to actuator saturation. By combing Eq. (5.4), the 
control objective rewrites as 
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min [|l], (5.6a) 
s.t. JÖ = K}'F — K,KĮ' Ax + ża, (5.6b) 
IIA; B;ll3 >d, (5.6c) 
0T <0 <6", (5.6d) 
6 <0<6", (5.6e) 


with 67, 0+, 6-, 6+ being the upper/lower bounds of joint angles and velocities, 
respectively. However, the optimization problem is described in different levels, i.e., 
joint speed level or joint angle level, which remains challenging to solve Eq. (5.6) 
directly. Therefore, we will rewrite this formula in velocity level. As to the key points 
A; on the robot, let x4; be the coordination of A; in the cartesian space, both x4; and 
Xj are available: 


Xai = fail), (5.7a) 


Xai = Jai, (5.7b) 


where f,4;(e) is the forward kinematics of point A;, and J4; is the corresponding 
Jacobian matrix from A; to joint space. Let us consider the following equality 


d 2 2 
gy A Billa) = RCIA: Billa = 4), (5.8) 


in which k is a positive constant. It is obviously that the equilibrium point of Eq. (5.8) 
d 
is || A: Bjl|5 = d. By letting q Ar Billa) > 0, the inequality (5.5d) can be readily 


guaranteed. Taking the time-derivative of || A; B;| | yields 


d d 
ZUBAID =F A- BTA; = B) 


1 FO 
———,, (A; — B;) (Ai — Bj) 


(By Aal3 
——>r : ——> 7 
=|B; Ail Jai(0)0 — |B;Ai| Bj, (5.9) 
Ae) CAs YT INOI2 i i ; ; È. j 
where |B; A;| = (A; — B;) /||0||5 is a unit vector from B; to A;, and B; is the 


velocity of key point B; on the obstacles. By Eqs. (5.9) and (5.6c), the inequality 
description of obstacle avoidance strategy is 


Tp A'T 5 2 ip ail aE 
|By Ail Jai(0)0 = kA; Bj|l5 — d) + |B;Ai| Bj, (5.10) 
Remark 5.2 In this part, we have shown the basic idea of obstacle avoidance scheme 


in velocity level, whose equilibrium point is described in Eq. (5.8). It is notable that 
the right-hand side of Eq. (5.8) is only acommon form to realize obstacle avoidance. 
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Generally speaking, the right-hand side of Eq. (5.8) may have different forms, such as 
k(\|Ai BjlI5 — d), k(||AiBj||3 — d)’, etc. From Eq. (5.10), the value of the response 
velocity to avoid obstacles is related to the two parts, the first part is the difference 
between the actual and safety distance, the other part depends on the movement of 
the obstacles. 


In terms of the physical constraints of joint angles, according to escape veloc- 
ity method, inequalities (5.6d) and (5.6e) can be uniformly described as max 
(a(@~ — 8), 87) < 6 < min(6t, w(6* — 0)). So far, the position-force control prob- 
lem together with obstacle avoidance strategy in velocity level is as below 


min ||4||5, (5.11a) 
s.t. JÈ = K] F —KyKjz' Ax + ia, (5.11b) 
max(a (07 — 6),6~) < 6 < min(6t, a(0* — 0)), (5.110) 
Jo0 < B. (5.11d) 


where (5.11c) is a rewritten inequality considering (5.6d) and (5.6e) based on escape 
locity sch Jo = [[B1A1|" Jats +; Bo Atl Jap. + [B1Aal J1; 3 [BpAal J 
velocity scheme , Jo = [|B, Aq] Jai: t; |BpAil Jap. +++» |B Adal Jags: ++ 3 |BpAal Jab] € 
pS ee ee ee, 


b b 
Rx” is the concatenated form of Ja; considering all pairs between A; and B ji 
B = [Bi1, -+> , Bib, ++- , Bal, +++ , Bap] € R® is the vector of upper-bounds, in 
— $ 

which —k(|| A;B; ll —d)- |B; A;|'B;. From the definition of J,, B, inequal- 
ity (5.11d) in equivalent to |B, A;|"J41(0)0 > k(\|A1Bil|5 — d) + |BiA1|" Bi... 
|ByAal’ Jaa(@)0 > k(||Aa Boll} — d) + |BpAa|" By, which is the cascading form of 
the inequality description (5.10) for all points pairs A; B;, i.e., if (5.11d) holds, the 
obstacle avoidance can be achieved. It is notable that a larger number of key points 
do help to describe the information of the obstacle more clearly, but it would lead to 
a computational burden, since the number of inequality constraints also increases. 
Therefore, the distance of the key points on the obstacle can be selected similar to 
those of the manipulator. 


5.3 RNN Based Controller Design 


In previous parts, we have transform the compliance control as well as obstacle 
avoidance problem into a constraint-optimization one. However, because that the 
QP problem described in Eq. (5.11) contains equality and inequality constraints, 
moreover, both Eq. (5.1 1b, d) are nonlinear, it is difficult to solve directly, especially 
in industrial applications in realtime. Based on the parallel computation ability, an 
RNN is established to solve Eq. (5.11) online, and the stability of the closed-loop 
system is also discussed. 
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5.3.1 RNN Design 


In terms with the QP problem Eq. (5.11), although the analytical solution can be 
hardly obtained, by defining a Lagrange function as 


L = |l +AT (KIF — K,Kj7' Ax + ta — J(0)8) +A3(Jo6 — B), (5.12) 


where à; and à% are state variables, respectively. According to Karush-Kuhn-Tucker 
(KKT) conditions, the inherent solution of Eq. (5.11) satisfies 


6 = Po — 2%) (5.13a) 

= = 77h” .l3a 
ae að 

JO = K} 'F — KyKj'Ax + Xa, (5.13b) 

Ap = (2 + Jo — B)”, (5.13c) 


where Pp (x) = argminy<¢||y — x|| is a projection operator of Ê to convex 92, and 

Q = {6 € R"|max(a(9~ — 0), 87) < 6 < min(6+, «(6+ — 6))}. In Eq. (5.13c), the 

operation function (e)* is defined as a mapping to the non-negative space. Equa- 
tion(5.13c) can be rewritten as 

à2>0 if J,6=B, 

p if Jĝ < B, (IA 


When Jô < B, the inequality Eq. (5.11d) holds, then àz stays zero. Instead, if 
the inequality reaches a critical state, A2 becomes positive to ensure Jĝ = B. In 
order to obtain the inherent solution in real time, a recurrent neural network is built 
as follows 


eð = — + P96 —6/|6|[5 + Ja, — JEM), (5.15a) 
eh, = K} 'F — K,Kj'Ax + Xa — J(0)0, (5.15b) 
tho = —ho + (Ao + Jo — B)”, (5.150) 


with £ being a positive constant scaling the convergence of Eq. (5.15). 

The proposed RNN based algorithm is shown in Algorithm 5.3.1. Based on escape 
velocity method, the convex set of joint speed can be obtained based on the positive 
constant œ and physical constraints 07, 0+, 7, 0~. After initializing state variables 
A; and A», the reference velocity can be obtained based on the desired command and 
actual responses according to Eq. (5.4) then the output of RNN (which is also the 
control command) can be calculated based on Eq. (5.15a), at the same time, both A, 
and A» can be updated according to Eqs. (5.15b) and (5.15c). 

In real applications, the nonlinear system can be hardly approximated completely. 
Therefore, the approximate error is inevitable, which would influence the perfor- 
mance of the proposed controller. However, the approximate error is a small value 
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Algorithm 4 Collision-Free position-force controller based on RNN 


Input: Positive control gains a, £, and interaction coefficients Kp, K4. Initial states qg(0) = 0, 4 (0), 
desired path xa(t), ta(¢) and operation force F4(t), task duration Ts, feedback of end effector’s 
coordination x(t) and contact force F, joint angles 0, Jacobian matrix J(@), information of the 
obstacles B; and Bj = 1,--- , b. Location of key points A;, i = 1,--- , a on the robot, and the 
corresponding Jacobian matrices J4;. Physical limitations 07, 0+, Ò`, 6+. Safety distance d. 

Output: To achieve position-force control without colliding with obstacles 

Initialize 4; = 0, à2 = 0. 

x,q, F, 6 < Sensor readings 

Calculate x4;, 4; and Ja; by Equation (5.7) 

Calculate matrices Jo, B by Equation (5.1 1d) 

Update upper and lower bounds of joint velocities by Equation (5.1 1c) 

Update output of RNN (joint velocity) by Ë using Equation (5.15a) 

Update 41 by À; using Equation (5.15b) 

; Update A2 by Az using Equation (5.15c) 

Until(t > Te) 


SN AARWN DS 


of higher order, and the influence can be suppressed based on the negative feedback 
scheme in the outer-loop, as shown in Eq. (5.4). 


Remark 5.3 The output dynamics of the proposed RNN is given in Eq. (5.15a), 
in which the projection operator Pg(e) plays an important rule in handling physi- 
cal constraints Eq. (5.11c), the updating of 6 depends on three parts: the first part 
—Å /| Il | in used to optimize the objective function | lô] [es and the second item JTA; 
guarantees the equality constraint Eq. (5.11b) by adjusting the dual state variable à; 
according to Eq. (5.15b), and the last item — JT A, ensures the inequality constraint 
Eq. (5.11d). The RNN consists of three kinds of nodes, namely, 6, i, and A>, with 
the number of neurons being n + ab + m. 


It is remarkable that the proposed controller is based on the information of system 
models such as J, Jo, Kp, etc., which is helpful to reduce computational cost. As 
to the constraint-optimization problem Eq. (5.11), the main challenge is to solve it 
in real-time, since the parameters in constraints Eqs. (5.11b) and (5.11d) are time 
varying. From Eq. (5.15), the control effort is obtained by calculating its updating 
law, which is based on the historical data and model information, i.e., it is no longer 
necessary to solve the solution of Eq. (5.11) as every step, and the computational 
cost is thus reduced. In the following section, we will also show the convergence of 
the RNN based controller. 

In this chapter, we mainly concern the obstacle avoidance problem in force control 
tasks. It is notable that force control is mainly based on the idea of impedance control 
theory, which is similar to existing methods in [35, 36]. The main challenge of the 
proposed control scheme lies in the limitation of sampling ability of cameras, which 
are used to capture the obstacles. To handle the measurement noise or disturbances, 
a larger safety distance d can be introduced to ensure the performance of obstacle 
avoidance. 
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5.3.2 Stability Analysis 


Lemma 1 (Convergence for a class of neural networks) [37] A dynamic neural 
network is said to converge to its equilibrium point if it satisfies 


KX = —X+ Ps(x — OF (x)), (5.16) 


where k > 0 and ọ > 0 are constant parameters, and Ps = argmin,-5||y — x|| is a 
projection operator to closed set S. 


Definition 1 For a given function F(e) which is continuously differentiable, with 
its gradient defined as VF, if VF + VF Tis positive semi-definite, F (e) is called a 
monotone function. 


About the stability of the closed-loop system, we offer the following theorem. 


Theorem 1 Given the collision-free position-force controller based on a recurrent 
neural network, the RNN will converge to the inherent solution (optimal solution) of 
Eq. (5.11), and the stability of the closed-loop system is also ensured. 


Proof Define a vector £ as £ = (0: Ài; Ao] € Rett, according to Eq.(5.15), the 
time derivative of £ satisfies 


e = —é + Polé — FE), (5.17) 


in which e > 0, and F (4) = [F1 €), Fo(&), F3(&)I", where Fi = 6/||6||3 — J7A1 + 
Joho, Fy = J — K} 'F + K,Kj' Ax — xa, Fs = —J,6 + B. By calculating the 
gradient of F (£), we have 


I/18} =J" JE 
VF(é) = J 0 0 (5.18) 
-JTI 0 0 


It is obviously that V F (£) is positive definite. According to Definition 1, F(&) is a 
monotone function. From the description of (5.17), the projection operator Ps can 
be formulated as Ps = [P2; Pr; Pa], in which Pog is defined in (5.13a), Pr can be 
regarded as a projection operator of A, to R, with the upper and lower bounds being 
too, and P, = (e)* is a special projection operator to closed set R2., Therefore, 
Ps is a projection operator to closed set [2; R”; R2]. Based on Lemma 5.1, the 
proposed neural network (5.15) is stable and will globally converge to the optimal 
solution of (5.11). 

Notable that the equality constraint (5.11b) describes the impedance controller, 
and the convergence can be found in [38]. Similarly, the establishment of inequal- 
ity constraint enables obstacle avoidance during the whole process. The proof is 
completed. 
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Remark 5.4 It is remarkable that the original impedance controller described in 
(5.11b) bears similar with traditional methods in [39] the main contribution of the 
proposed controller is that the controller can not only realize the force control, but also 
realize the obstacle avoidance, besides, the control strategy is capable of handling 
inequality constraints, including joint angles and velocities. 


5.4 Numerical Results 


In this part, a series of numerical simulations are carried out to verify the effectiveness 
of the proposed control scheme. First, the pure force control experiment is carried 
out to show the effectiveness of the force controller, and then the control scheme 
is further verified by testing the system response after the introduction of obstacles. 
Then, we examine the control performance in more general cases, including model 
uncertainty and multiple obstacles. 


5.4.1 Simulation Settings 


First of all, the planar robot used in the simulation is the same as the previous 
chapters. It is worth noting that in the force control task, the final actuator needs to 
keep contact with the workpiece, so it is necessary to distinguish between necessary 
contact and unnecessary collision. In this chapter, the proposed controller can handle 
this problem by properly selecting key points. As a result, the final effector is not 
considered critical in order to be in contact with an obstacle (or external environment). 
In order to avoid obstacles, the set of key points of the robot is defined as Aj, --- , A7, 
in which Aj, A3, As and A; locate at the center of the links, and A», A4 and A¢ are 
defined to be at J2, J3 and J4. The lower and upper bounds of joint angles and 
joint velocities are defined as 6; = —3rad, a = 3rad, J = — rad/s, åt = lrad/s 
fori = 1...4, respectively. The safety margin is selected as 0.01m. The coefficients 
describing the contact force are selected as Ky = 50, K, = 5000. For simplicity, let 
bo = K} 'F — K)Kj Ax + ža. 


5.4.2 Force Control Without Obstacles 


First of all, an ideal case where there is no obstacles in the workspace is considered, 
and the parameters K4 and K „ are assumed to be known. The robot is wished to offer 
a constant contact force on a given plane. The contact force is set to be 20 N, while the 
direction of contact force is aligned with the y-axis of the tool coordination system. In 
this example, the y-axis of is [1, —1]" in the base coordination. The pre-defined path 
on the contact plane is xg = [0.4 + 0.lcos(0.5t), 0.5 + 0.1cos(0.5t)]. The initial 
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Fig. 5.1 Numerical results of compliance control without obstacles. a is the robot's tracking path 
and the corresponding joint configurations. b is the profile of position error along the free-motion 
direction. ¢ is the profile of contact force. d is the profile of iô 


state of the robot system is set as 0) = [1.57, —0.628, —0.524, 0.524]Trad, Gp = 
[0, 0, 0, O]' rad/s. The control gains of the proposed RNN controller are a = 8,¢ = 
0.02, respectively. Numerical results are shown in Fig.5.1. The tracking error along 
the contact plane is given in Fig. 5. 1b, the transient is about 1 s. At the beginning stage, 
since the end-effector is not in contact with the surface, the contact force stays zero 
before 0.5 s. As the end-effector approaches the surface, the contact force converges 
to 20 N, showing the convergence of both positional and force errors. The Euclidean 
norm of joint velocities (which is also output of the established RNN) is shown in 
Fig.5.1d, |||| changes periodically, with the same cycle as the expected trajectory. 
The time history of the end-effector’s motion trajectory and the corresponding joint 
configurations are shown in Fig. 5.la, in which the red arrow indicates the direction 
of the contact force, and the blue arrow shows the direction of the end-effector s 
free-motion. All in all, the proposed controller can achieve the position-force control 
precisely. 


5.4.3 Force Control with Single Obstacles 


In this chapter, a stick obstacle is introduced into the workspace, which is defined 


as x = —0.05 m. The initial states and expected values of xg, Fa are the same as 
Chap. 5.4.2. 
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Fig. 5.2 Control performance of the proposed controller while avoiding a wall obstacle. a is the 
robot's tracking path and the corresponding joint configurations. b is the profile of position error 
along the free-motion direction. ¢ is the profile of contact force. d is the profile of joint angles, r 
is the profile of joint velocities. f is the profile of the closest distance to the obstacle of each key 
points Aj,i =1,---,7 


Remark 5.5 In Eq. (5.10), we have shown the basic idea of calculating the distance 
between the robot and obstacles, i.e., by abstracting key points form the robot and 
obstacles, the distances can be the robot and obstacle can be described approximately 
at a set of point-to-point distances. In this example, the distance can be obtained in a 


simpler way. However, the obstacle avoidance strategy is essentially consistent with 
Eq. (5.10). 


Simulation results are given in Figs.5.2 and 5.3. The output of RNN is shown 
in Fig. 5.2e, when simulation begins, Ê reaches its maximum value, driving the end- 
effector to move towards the desired path. And then the robot slows down quickly 
(after t © 0.5s), the robot move smoothly, as a result, the position error successfully 
converges to 0, and simultaneously, the contact force converges to 20 N. It is notable 
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Fig. 5.3 Simulation results of the established RNN while avoiding a wall obstacle. a is the profile of 
A,.b is the profile of 4. ¢ is the profile of || J 6- bo| |2: d is the profiles of the desired and reference 
trajectory along x-axis. e is the profiles of the desired and reference trajectory along y-axis. f is the 
profiles of the objective function of the proposed controller and JPMI based method 


that at t = 1.2 s, the key point Az of the robot gets close to the obstacle, as shown 
in Fig. 5.2f. Based on the obstacle avoidance strategy Eq. (5.15c), the state variable 
A2(2) becomes positive, and then the output of the RNN varies with Az (Fig. 5.3b). 
Correspondingly, an error (about 1 x 10~*m) occurs in the positional tracking, and 
so as the contact force (force error is about 2N). However, the RNN converges 
to the new equilibrium point(since the equilibrium point would change when the 
inequality constraint works), and both e, and ey converges to 0. By comparing 
Figs.5.2a and 5.1a, after introducing the obstacle, the robot is capable of adjusting 
its joint configuration to avoid the obstacle. The distances between the key points 
A, — A; to the obstacle are shown in Fig. 5.2d, a minimum value of about 0.01m is 
ensured during the whole process. Using impedance model, the force control problem 
is transferred into a kinematic control one by modifying the reference speed Eq.(5.4). 
Consequently, the resulting trajectory x, together with xq are as shown in Fig. 5.3d, e. 
As an important index in the proposed control scheme, the norm of joint speed | \6| f 
is wished as small as possible. Therefore, we introduce a comparative simulation, 
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in which the solution is obtained based on pseudo-inverse of Jacobian matrix and 
physical limitations are not considered. Comparative curves of the objective functions 
are as shown in Fig.5.3f. The RNN based controller can optimize the objective 
function, it is remarkable that a difference appears at about t = 1.2 — 5 s, which 
is mainly caused by obstacle avoidance (which is not considered in JMPI based 
method). Since the output of RNN 6 is used to approximate the reference speed bo, 
the approximate error ||J 6 — bo | E is shown in 5.3c, demonstrating the effectiveness 
of the established RNN. 


5.4.4 Force Control with Uncertain Parameters 


In this example, we check the control performance of the proposed control scheme 
in presence of model uncertainties. Similar with previous simulations, the ini- 
tial states of the robot are also 6) = [1.57, —0.628, —0.524, —0.524]' rad, ĝo = 
[0, 0, 0, O]' rad/s. In real implementations, the interaction model is usually unknown, 
and the nominal values of K4 and K, are not accurate. Without loss of generality, 


we select the nominal values of K4 and K, as Ka = 80, K p = 4000, respectively. 
In order to handle model uncertainties in the interaction coefficients, an extra node 
is introduced into (5.15). Then the modified RNN can be formulated as 


eË =—6 + Po(6 — 6/||O3 + ITAL — JZ), 
eA, = Ki Ire K, Kł l Ax + ža — J(0)6, 
elz = —Ar + (Ao + Jo — B)*, 

W = —Kinn(Fa— FY", 


in which W = [K,; Ka], n = [x — xa; x — xq], and the positive coefficient K;„ scal- 
ing the updating rate is defined as Ki, = diag(500, 20). Simulation results are shown 
in Fig. 5.4 and Fig. 5.5. Although the exact values of K4 and K, are unknown, the 
closed-loop system is still stable, which can be shown from the convergence of track- 
ing error e, and contact force F in Figs.5.4a and 5.4b. The change curves of joint 
angles and joint velocities with respect to time are shown in Fig.5.4c, d, in which 
the bounded-ness of joint angles and velocities are guaranteed. The observed inter- 
action coefficients Ky and K, are shown in Fig. 5.4e, indicating that both Kq and 
K p converge to their real values. Figure 5.5a shows the distances between the key 
points and the obstacle, it is obvious that all key points keep at a safe distance from 
the obstacle (the closest key point is Az). Euclidean norm of bọ — J 4 is illustrated in 
Fig. 5.5c, despite fluctuation occurs at about t = 1.5s, the proposed controller could 
handle model uncertainties. The impedance model based reference trajectory and 
the original desired trajectory are shown in Fig. 5.5d and Fig. 5.5e. Although x, and 
xq are different, the tracking error ey along the direction of free motion and force 
error er converges to zero, as shown in Fig. 5.4a, b. The objective function ||| F to 
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Fig. 5.4 Control performance of the proposed controller while avoiding a wall obstacle with uncer- 
tain K, and Ka. a is the robot's tracking path and the corresponding joint configurations. b is the 
profile of position error along the free-motion direction. ¢ is the profile of contact force. d is the 
profile of joint angles. e is the profile of joint velocities. f is the profile of the closest distance to the 
obstacle of each key points A;, i = 1,...,7 


be optimized is given in Fig. 5.5f. The convergence of the established RNN is shown 
in Fig.5.5c, despite the uncertain parameters, using the adaptive updating law, the 
established RNN is capable of learning the optimal solution. The spikes are mainly 
because of the change of A2 when obstacle avoidance scheme is activated. 


5.4.5 Manipulation in Narrow Space 


In this part, we discuss a more general case of motion-force control task, in which the 
workspace is defined in a limited narrow space. The robot is limited by two parallel 
lines, namely, yı = 0.15 and y} = —0.15 m. Considering the safety distance, all 
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Fig. 5.5 Simulation results of the established RNN while avoiding a wall obstacle with uncertain 
K, and K4. ais the profile of 4. b is the profile of 2. ¢ is the profile of || JO — bo| B: d is the profiles 
of the desired and reference trajectory along x-axis. e is the profiles of the desired and reference 
trajectory along y-axis. f is the profiles of the objective function of the proposed controller and 
JPMI based method 


key points except Ag must satisfy the workspace description —0.14 < y < 0.14m. 
The initial joint angles are set to be 6) = [0.393, — 1.05, 1.57, —0.785]! rad, and 
ôo = [0, 0, 0, OJ rad/s. The desired path is selected as xg = [0.8 + 0.1cos(0.5t), — 
0.15]'m, and the expected contact force is Fy = 20 N, with the direction vector 
being [0, —1]’. Simulation results are given in Figs.5.6 and 5.7. When simulation 
begins, the initial position error is about 0.1 m, and the converges to zero, with the 
transient being about 0.5 s. Simultaneously, the contact force also converges to 20 N. 
In Fig. 5.7a, minimum distances between the key points to yı and y2 are represented 
by blue and red curves, respectively. The tracking trajectory and the corresponding 
joint configurations are shown in Fig.5.6a. During t = 1 — 1.5s and t = 6 — 13s, 
point A, gets close to yı, during t = 4 — 7s, A4 is close to y2. Remarkable that there 
exist fluctuations in positional and force errors at t = 1s and t = 4s, (i.e., when A2 
and A, get close to the bounds), respectively. Similar to the previous simulations, the 
reference trajectories are given in Fig. 5.5c, d, and the objective functions are shown 
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Fig. 5.6 Control performance of the proposed controller in a narrow workspace. a is the robot's 
tracking path and the corresponding joint configurations. b is the profile of position error along the 
free-motion direction. c) is the profile of contact force, d is the profile of joint angles. (e) is the 
profile of joint velocities. (f) is the profile of the closest distance to the obstacle of each key points 
Aj,i=l,---,7 


in Fig. 5.5e. Using the proposed RNN controller, the robot can realize both position 
and force control in limited narrow space. 


5.4.6 Comparisons 


In this part, comparisons among the proposed control scheme and existing methods 
are given to show the superiority of the RNN based strategy. The comparisons are 
shown in Table 5.1. In [22], an RNN based controller is designed for redundant manip- 
ulators, both obstacle avoidance and physical constraints are considered. However, 
the controller only focus on kinematic control problem. In [40] and [16], force control 
together with obstacle avoidance are taken into account, but the physical constraints 
are ignored. [23] develop an adaptive admittance control strategy, which is capable 
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Fig. 5.7 Simulation results of the established RNN in a narrow workspace. a is the profile of A1, 
b is the profile of A2, ¢ is the profiles of the desired and reference trajectory along x-axis. d is the 
profiles of the desired and reference trajectory along y-axis. e is the profiles of the objective function 
of the proposed controller and JPMI based method 


Table 5.1 Comparisons among The Proposed Controller and Existing Methods 


Method Provable Optimize in | Physical Force versus | Collison 
convergence | realtime limitations kinematic avoidance 

control 

This chapter | Yes Yes Considered Force control | Yes 

Method in Yes Yes Considered kinematic Yes 

[22] control 

Method in Yes Yes Ignored Force control | Yes 

[40] 

method in [23] | Yes Yes Considered Force control | No 

method in [16] | Yes No Ignored Force control | Yes 
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of dealing with force control under model uncertainties, physical constraints and 
real-time optimization. It is remarkable that the proposed strategy focus on real-time 
obstacle avoidance in force control tasks, and the controller is capable of ensuring 
the boundedness of joint angles and velocities. At the same time, simulations have 
shown the potential of optimization ability of norm of joint speed. 


5.5 Summary 


This chapter constructs a new collision free compliance controller based on the 
concept of QP programming and neural network. Different from the existing methods, 
this chapter describes the control problem from the perspective of optimization, 
taking compliance control and conflict avoidance as equal or inequality constraints. 
Physical constraints, such as joint angle and speed constraints, are also considered. 
Before concluding this chapter, it is worth noting that it is the first RNN based 
compliance control method, which considers collision avoidance in real time and 
shows great potential in dealing with physical constraints. In this chapter, Matlab 
is simulated to verify the efficiency of the controller. In the future, we will check 
the control framework of different impedance models in the physical real simulation 
environment, and then consider the machine vision technology and system delay on 
the physical experiment platform. 
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Chapter 6 A) 
RNN for Motion-Force Control of get 
Redundant Manipulators with Optimal 

Joint Torque 


Abstract Precise position force control is the core and difficulty of robot technology, 
especially for robots with redundant degrees of freedom. For example, track-based 
controls often fail to grind the robot due to the intolerable impact force applied 
to the end-effector. The main difficulties lie in the coupling of motion and contact 
forces, redundancy analysis and physical constraints. In this chapter, we propose a 
new motion force control strategy under the framework of recursive neural network. 
The tracking error and contact force are described respectively in the orthogonal 
space. By choosing the minimum joint torque as the secondary task, the control 
problem is transformed into the QP problem under multi-constraint conditions. In 
order to obtain real-time optimization of the joint torque relative to the non-convex 
joint angle, the original QP is reconstructed at the velocity level, and the original 
objective function is replaced by the time derivative. Then a convergent dynamic 
neural network is established to solve the improved QP problem online. The robot 
position control based on recursive neural network is extended to the robot position 
control based on position force, which opens a new way for the robot to turn from 
simple control angle to crossover design with convergence and optimality. Numerical 
results show that the proposed method can realize precise position force control, deal 
with inequality constraints such as joint angular velocity and torque limitation, and 
reduce joint torque consumption by 16% on average. 


6.1 Introduction 


Redundant manipulators, which have more DOFs than those required to complete a 
given task, are more flexible than non-redundant ones. The redundant DOFs enable 
manipulators to realize the fault tolerant control, improve operation performance 
and enhance reliability. Therefore, redundant manipulators have been widely used 
in industry, agriculture, military, space exploration, etc. Consequently, the research 
on the redundant manipulator has been studied intensively [1—4]. 

Motion control and force control are two main modes of redundant manipulator 
control. In motion control problems, a basic assumption is that there is no contact 
between the robot and the environment, that is, the robot can move freely in the 
workspace [5]. This problem is well-reflected in coating, welding, stacking, stacking 
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and other applications. Then the core problem is to design control commands that 
drive the robot to follow a predetermined trajectory. The control command may be 
joint angular sequence [6], velocity sequence [7], acceleration sequence [8, 9] or 
torque sequences [10-12]. The redundancy resolution is usually used to achieve a 
secondary task, such as avoiding obstacles [13], avoiding singularities [14], etc. Dif- 
ferent from motion control, force control involves the direct interaction between a 
robot and its environment. The control of contact force can enhance the robustness 
and flexibility of the robot in the weak structure environment, so as to enhance the 
robot's operation ability [15]. Corresponding typical applications in polishing, grind- 
ing, assembly, polishing, polishing and other fields [16, 17]. In [18], the theoretical 
framework of impedance control is proposed. The basic idea is to use the environment 
as admittance and the robot as impedance.By maintaining the dynamic relationship 
between force and motion, the controller is represented as a spring-mass-damping 
system. In [19], a hybrid position force controller is proposed which combines posi- 
tion information with force information to realize the simultaneous control of position 
constraint and force constraint. Based on these two control frameworks, a series of 
controllers are proposed and verified by simulation or experiments [20-22]. 

Although the above work has achieved great success in motion force control 
of non-redundant robots, the control of redundant robots has not attracted enough 
attention. It is worth noting that the redundancy of the manipulator provides an 
opportunity to meet secondary objectives, but also sets up mathematical difficulties. 
In [23], in order to realize the flexible control of unknown obstacles, a position 
force control strategy is proposed. The motion of the robot is completely decoupled 
into two parts, namely the motion of the end-effector and the internal motion. The 
motion of the end-effector is controlled to achieve positional force control over the 
environment, while the internal motion is designed to avoid obstacles by minimizing 
impact. In [24], a robust control strategy with the ability to adjust contact force and 
apparent impedance is designed. The controller has strong robustness in dynamic 
and kinematic uncertainties. In [25], Patel et al. proposed a hybrid impedance control 
scheme based on pseudo inverse jacobian. The limitation of joint angle is avoided by 
defining a function that scales the difference between joint angle and its boundary. 
However, these literatures require continuous computation of the pseudo inverse of 
the jacobian matrix, which brings huge computational burden and is difficult to deal 
with multiple constraints [26]. 

In order to solve the redundant solution problem of redundant robots, a feasi- 
ble method is to transform the control problem into an optimization problem under 
constraints [27]. The objective function is established according to the secondary 
task, and the constraint conditions are established according to the primary task and 
physical constraints. This optimization problem is often described as a QP problem 
[28]. Because of the high efficiency of parallel computing, recursive neural network 
is often used to solve QP-based redundant solutions online [29]. In recent years, the 
controller based on recursive neural network (RNN) has been introduced into the 
motion control of redundant robots. A new redundancy decomposition method is 
proposed, which constructs a robust neural network at the velocity level to guarantee 
the boundary of joint acceleration [30]. In [31], RNN is improved to allow projection 
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operations on non-convex sets, avoiding the accumulation of tracking errors to sys- 
tem noise. In [32], a method is proposed to realize operational optimization through 
indirect maximization of time derivative. In [33], cooperative control of distributed 
multi-robot is studied. Recently, RNN has been extended to control flexible robots, 
model uncertainties and other issues [34-37]. Although the motion control of redun- 
dant robot based on RNN has obtained good research results, as far as we know, there 
is no research report on the application of RNN in motion force control of robot. 

On this basis, we propose a position force control scheme based on RNN, which 
is an important extension of recursive neural network in robot control. Table 6.1 pro- 
vides a brief comparison between the proposed and existing programmes. Unlike 
[25] and [23], in this chapter the motion force controller is established in the joint 
velocity stage and allows for multiple inequality constraints. The non-convex opti- 
mization problem is studied without loss of generality. With [31, 32, 34] the existing 
similar motion controller, the proposed motion force controller is no longer needed 
the pseudo inverse Jacobian. 

This chapter mainly studies the following aspects. In the second part, the tracking 
error and contact force are modeled, and the control problem is written as QP prob- 
lem. In Chap. 3, QP is reformulated at the velocity level by rewriting the objective 
function and constraints. In the fourth part, we set up an RNN to solve the redun- 
dant resolution problem. Stability has also been demonstrated. In the fifth part, the 
numerical experiments of 4-DOF planar manipulator are carried out. Finally, the 
sixth part carries on the summary to the full text. By the end of this section, the main 
contributions are as follows: 


e As far as we know, this is the first research to study the motion force control of 
redundant robots in the framework of RNNs. It is worth noting that force sensitive 
manipulators, such as milling robots and polishing robots, cannot successfully 
control the use of existing results in RNNs, while the RNN model built in this 
work can do this. 

e Inthe proposed control scheme, the motion-force control problem as well as redun- 
dancy resolution problem are reconstructed to facilitate practical implementations. 

e The controller is capable of handling multiple inequality constraints, including but 
not limited to angle constraints, angular velocity constraints and torque constraints. 
This is of great significance in improving system security. 

e The contribution of this chapter also lies in the realization of real-time optimization 
of joint torque, which is helpful to save energy in industrial applications. 


6.2 Problem Formulation 


6.2.1 Problem Formulation 


In this chapter, we focus on position-force control problem for redundant manipula- 
tors. Figure 6.1 gives a brief introduction of a redundant robot and its operation on an 
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Table 6.1 Comparisons among the Proposed Motion-force Control Scheme and Existing Ones 


Nonconvex Motion-force | Control Pseudo- Handling 
inverse Multiple 
Control Command Required Inequality 
Constraints 


This chapter | No Yes Joint Velocity | No Yes 
Methods [31, | No No Joint Velocity | No Yes 

32] 

Method [25] | + Yes Joint Torque | Yes No 

Method [23] | + Yes Joint Torque | Yes Restrictivet 
Methods [34] | Yes No Joint Velocity | No Yes 


f In [25] and [23], no projection operations are introduced in those control strategies 
+ In [23], only some certain kinds of inequalities can be handled by [23] 


Fig. 6.1 A brief diagram of 
a planar redundant 
manipulator and its operation 
on a workpiece 


workpiece. The robot is expected to offer a desired contact force in the vertical direc- 
tion of the contact surface, at the same time, the end-effector is required to track a pre- 
defined trajectory along the surface. In the base coordinate frame Ro (00, xo, Yo, Zo), 
forward kinematics of a serial manipulator can be written as 
FOE) =x), (6.1) 
where 0 € R” is the vector of joint angles, and x € R” represents the end-effector’s 
coordinate vector in frame Ro, f(e) : R” —> R” is used to describe the forward 
kinematics operator. For a redundant manipulator, we have n > m. 
By differentiating x(t) with respect to time t, we can get the relationship between 
Cartesian velocity x(t) € R” and joint velocity (or joint control signal) 6(t) € R” 
as follows: 


JOMO = X(t), (6.2) 


where J (0(t)) = 0 f (0 (t))/90 (t) is called Jacobian matrix. 
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In position-force control tasks, the end-effector’s motion is constrained by the 
contact surface. For simplicity, we define a tool coordinate system as R;(x;, Yr, Zt), 
in which the axis z, is set in alignment with the vertical direction of the contact 
surface. Obviously, the motion of end-effector can be specified along x; and yy. 
In this chapter, frictional force between the robot and contact surface is ignored, 
therefore, the contact force F is in alignment with zz. 

In the tool coordinate system R,, let 5X, be the displacement between effector 
and its position command, then the contact force F, can be formulated as 


F, = k; D,8X,, (6.3) 


where ky > 0 is the stiffness coefficient, X, = diag(0, 0, 1). Diagonal matrix X, 
describes the relationship between the contact force and relative displacement along 
different axes: 1 means that displacement component along z; affects the contact 
force, and 0 otherwise. 
Similarly, in tool coordinate system R,, the position tracking error e, can be written 
as 
e, = 55k. (6.4) 


where X, = I — ¥șş = diag(1, 1,0), 1 means there is a DOF of movement along 
the corresponding direction, and 0 otherwise. 

When the contact surface is prior known, R, can be obtained from Ro by a rotation 
matrix S,. Let F, eg and 6X be the corresponding description of F,, e, and 6X, in 
coordinate frame Ro, then we have F = SI Fy, e, = S eoand êX, = S,ô X. Therefore, 
F and eg can be rewritten as 


F=k,;S'S,S,6X, 
| ees (6.5) 


eo = SES 5X. 


Notable that in frame Ro, the displacement 6X can be described as 6X = x — xq, 

where xq is the desired position signals described in Ro. Using (6.1), (6.5) can be 

rewritten as 

F =kyS)Z5S,(f @) — xa), (66) 
eo = ST ZpS (f (0) — xa). l 


Remark 6.1 Equation (6.6) gives the unified description of the relationship between 
the contact force F, position tracking error eọ and displacement 6X in Ro. ôX will 
lead to contact force F in the vertical direction, and position tracking error eg along 
the contact surface. 


In real implementations, given the desired contact force F, and trajectory com- 
mand xq, the manipulator’s end-effector is expected to offer contact force Fy while 
tracking xq, i.e., F — Fa, eo — 0. For the convenience of writing in the following 
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sections, let A = [k SF X; S1; STE p S:], r = LFT, ef)", and ra = [ Fu; 0]. Then (6.6) 
can be reformulated as 
A(f (9) — xa) =r. (6.7) 


Therefore, the control objective of position-force control is to adjust the joint 
angles 0, to ensure r > ra. 


6.2.2 Joint Torque and Physical Constraints 


When the end-effector offers a contact force F, the corresponding torque is provided 
by motors at every joint. The relationship between contact force F and the joint 
torque t can be formulated as 


t = J'()F. (6.8) 


In the control of redundant manipulators, there would be infinite groups of solu- 
tions to a certain control task. In order to save energy during the control process, 
we select a objective function scaling energy consumption as t't/2. The smaller 
t'r/2, the less energy consumption. 

In real implementations, the system is limited by physical constraints. For exam- 
ple, the joint angles 0 and velocities Ê must not exceed their limits: @™™",9™*, ġmin, 
6™, since the possible collisions or overheating of motor would lead to irreversible 
damages. At the same time, considering the bounded torque output of the motors, 
the limitation of joint torque t is described as t™" < t < t™™, 


6.2.3 Optimization Problem Formulation 


According to the descriptions above, the position-force control problem for redundant 
manipulators considering torque optimization can be formulated as 


min G, =t'1/2, (6.9a) 
s.t. t=J'F, (6.9b) 
ra = A( f (0) — xa), (6.9c) 

Pe shag. (6.9d) 

min < Å < gm, (6.9e) 
pega. (6.9f) 


with 6 being the decision variable. Equation (6.9a) is the cost function to be mini- 
mized, the equality constraint (6.9b) describes the relationship between the resulting 
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joint torque t and contact force F. The force and motion tasks of the robot are 
described in (6.9c), and inequality constraints (6.9e), (6.9d) and (6.9f) show the 
physical limitations to be satisfied. By substituting (6.9b) into (6.9a), the optimiza- 
tion problem can be rewritten as 


min G, = F'J(0)J'(0)F/2, (6.10a) 
s.t. ra= A(f (0) — xa), (6.10b) 
gmin < o < 9mm (6.10c) 
gmin < § < 9™™, (6.10d) 
e (6.10e) 


To solve (6.10), there are two main challenges. Firstly, as an objective function to 
be minimized, FT J(0)JT (0) F /2 is usually non-convex relative to 6, because it is a 
function of J (0). Secondly, the equation constrain (6.10b) is highly nonlinear, and 
at the same time, it remains difficult to handle the inequality constraints, especially 
(6.10d) and (6.10e). 


6.3 Reconstruction of Optimization Problem 


In this section, in order to overcome the above difficulties, the redundancy resolution 
problem (6.10) will be reconstructed. The objective function is firstly redefined, and 
both equality and inequality constrains are rebuilt in velocity level. 


6.3.1 Reconstruction of Objective Function 


As to F'J(6)J™(6)F /2, we will replace F with the desired value Fy. Therefore, the 
optimization function can be formulated as G3 = Fj J (0) J" (6) Fa/2. 


Remark 6.2 There are two main reasons: firstly, according to the control objective, 
the contact force F is expected to track Fy, if the controller is proper designed, F 
will eventually converge to Fy, consequently, FT J (0) J" (6) Fa/2 will be equivalent 
to FTJ(0)JT(0)F/2. Secondly, Fa is independent of 0, this replacement will reduce 
the computational complexity in the control process. 


Differentiating G2 with respect to time, we have 


G2 = (J"(0) Fa)" 


TF 
UU” LE (6.11) 
dt 


Obviously, G» describes the change of G2. By minimizing Go, the system will be 
compelled to develop in the direction of decreasing G2. Therefore, in this chapter, 
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we use Go instead of G2 as the new objective function. Notable that d(J™ (0) Fy) /dt 
can be formulated as 


d = (ITO) Fa) ; ; 

— (JT (0) Fj) = TA HJE, 

qt Fa) 2, T + JT) Fa ih 
= [Hj,---, H ]ô + J" (0) Fa, 


where H; € R” is 


Dra OIG, DFaG))/30;) 
_ A(T @)Fa) _ | Vij OG, DFaG))/30:) 


Hi; 
00; aw 
Vn OIG, n) Fa(7))/96) 
Let H = [HM,--- , Hy], then (6.11) can be converted as 
Gy = FIJ HÈ + FUJI" Fa. (6.13) 


It is worth pointing that the second term of (6.13) is independent of 6, therefore, the 
objective function is equivalent to FT J H9. 


6.3.2 Reconstruction of Constraints 


In this part, we will transform the constrains into velocity level. First of all, we 
define a concatenated vector describing force and position errors as e = f — ra = 
[F — Fa; eo], according to (6.7), e can be formulated as 


e = A(f (0) — xa) — ra. (6.14) 
Differentiating e and combing (6.2) yields 
é = A(JO — ža) — Fa. (6.15) 


To ensure that e converges to zero, a simple controller can be designed as é = —ke, 
where k > Ois a positive constant. According to (6.14), (6.15), the equality constrains 
can be converted in velocity level as 


AJO = rg + Akg — k(Af (0) — xa). (6.16) 

As to the inequality constraints (6.10c) and (6.10d), according to [27], let œ = 6 

and define œ > 0 as a constant parameter to scale the negative feedback to conform 
the joint constraints, these two constraints can be formulated in the speed level as 
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mn <o< on™, (6.17) 

where w™" = max{a(6™" — 6), 8™"}, and o™™* = minfa (0™* — 0), A™}, 
Similarly, (6.10e) can be built indirectly by limiting its derivative: B(t™" — t) < 
t < B(t™* — t), where £ is a positive constant. By combing (6.12), the boundedness 
of joint torque can be rewritten as an inequality constraint about a function g(@) as 


g(@) <0, (6.18) 


where g(w) = [B(ct™" — t) — JT Fy — Ho, Ho — B(t™ — t) JT + Fa]! € R”. 


6.3.3 Reformulation and Convexification 


According to the above description, in order to achieve position-force control of 
redundant manipulators, instead of solving (6.10) directly, one feasible solution is to 
solve the optimization problem in velocity level as 


min Fi JHo, (6.19a) 
st 7r, = AJo, (6.19b) 
gœ) < 0, (6.19c) 

we Q, (6.19d) 


where r, = rg + Axa — K(Af (0) — xa), 2 = {w € R” jomin < wi < wp” } is a con- 
vex set. It is remarkable that the objective function described in (6.19a) is non-convex 
relative to w. Therefore, (6.19b) is introduced to convexity (6.19a). The final form 
of optimization problem is described as 


min Fi JHo+t+ (AJa@ —1r,)'(AJ@ —r,), (6.20a) 
s.t. r, = AJo, (6.20b) 
g(@) < 0, (6.20c) 
wo€E LQ. (6.20d) 


So far, we have reconstructed the position-force control with joint torque optimization 
problem into a quadratic programming issue under constraints. However, the QP 
problem (6.20) cannot be solved directly. 
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6.4 RNN Based Redundancy Resolution 


In this chapter, in order to solve the optimization problem (6.20), an expanded recur- 
rent neural network is built to obtain the optimal solution of (6.20). Stability will be 
also discussed. 


6.4.1 RNN Design 


Firstly, let A; € R?” and Az € R” be dual variables to constraints (6.20b) and (6.20c), 
a Lagrange function is defined as 


L =F] J Ho + (AJo —1,)' (AJo — r,) 
+ ATU, — AJo) +A38(@). (6.21) 


According to Karush—Kuhn—Tucker condition, the optimal solution of the opti- 
mization problem (6.20) can be equivalently formulated as 


OL 
w = Pe(wo- —), (6.22a) 
dw 
r, = AJo, (6.22b) 
Ao = (An + g(o))*, (6.22c) 


where Po(x) = argmin,.9||y — x|| is a projection operation to convex set 2, and 


T = aF, ah) a7 = may, 0). 
In order to solve (6.22), an expanded recurrent neural network is designed as 


cò = — w + Po(w— HJT Fi — JTAT (AJo — r,) 


+ JTA" — Vg), (6.23a) 
ed, =r, — AJo, (6.23b) 
e2 =(h2 — A2 + 8(@))*), (6.23c) 
0 f) m . is 

where Vg = ( A oe, = ) = [-H", HT] € R”*” eis a positive constant scal- 


w wW 
ing the convergence of (6.23). The pseudo code of the RNN-based strategy is shown 
in Algorithm 5. 
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Algorithm 5 The RNN based position-force controller 


Input: Control parameters €, k, œ, stiffness coefficient k f, prior knowledge of the contact surface 
Sı, X; and Y;. The robot’ joint limits 6", gmin and joint speed limits ġmax, gmin, initial joint angles 
6(0), desired tracking trajectory rg(t), ra(t), and contact force Fy. Feedback of actual coordinate 
x(t), contact force F(t), and joint angle 0, task duration T. 
Output: To achieve position-force control of the redundant manipulator and optimize joint torque 
1: Initialize à; (0), 2 (0) 
2: Repeat 
i On-line feedback of F, 6, x < from sensors 
Calculate the weight matrix A 
Calculate the reference command r, < Eq. (6.19) 
Update joint velocity command w by w < Eq. (6.23a) 
Update state variable A, by Ay < Eq. (6.23b) 
; Update state variable à2 by ho < Eq. (6.23c) 
Until (t > T) 


oor: SA Oye te 


6.4.2 Stability Analysis 


In this part, theoretical analysis of stability and convergence of closed-loop system 
using the proposed neural network (6.23). 

First of all, several important definitions and Lammas are presented, which is very 
useful in stability analysis. 


Definition 6.1 A continuously differentiable function F (e) is said to be monotone 
if VF + VF" is positive semi-definite, where V F is the gradient of F (e). 


Lemma 6.1 [38] A dynamic neural network is said to converge to the equilibrium 
point if it satisfies 


Kx = —X + Ps(x — OF (x)), (6.24) 


where k > Qand @ > O are constant parameters, and Ps = argmin,-5||y — x|| is a 
projection operator to closed set S. 


So far, a theorem about the convergence of the redundancy resolution problem 
can be described as follows 


Theorem 6.1 Given the motion-force control problem for redundant manipulators 
with torque optimization under physical constraints as (6.20), the recurrent neural 
network (6.23) is stable and will globally converge to the optimal solution of (6.20). 


Proof Let £ = [w", Af, A3]", the proposed RNN (6.23) can be written as 


e = —§ + Polé — FE)I, (6.25) 
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where F(E) = [Fi (E), Fo(€), F (E)]T e R2"*3", in which 


F; HTJT Fa + JTAT(AJoæ —r,)— JTATà, + Vg 
Fa | = ài— r, +AJæ 
F; —g (w) 


Let VF (é) = 0F /0é, we have 
JTATAJ —J™A™ Vg 
VF()= AJ I O |. (6.26) 
—(Vg)"_ 0 0 


It is remarkable that 


2JTATAJ 0 0 
VEFE) + (VEFE)! = 0 21 0|. (6.27) 
0 0 0 


From Definition 6.1, F (E) is a monotone function of &. 

According to the description of (6.23) and (6.25), Ps can be formulated as P = 
[Po; Pr; Pal, where Pr € R” is a projection operator of A; to set R, with the 
upper and lower bounds being -too. Furthermore, (e)* is a special case of Pa, in 
which A = R?” is the nonnegative quadrant of IR". Therefore, Pg is a projection 
operator to closed set 2. Based on Lemma 6.1, the proposed neural network (6.23) 
is stable and will globally converge to the optimal solution of (6.20). The proof is 
completed. 


6.5 Illustrative Examples 


In this chapter, taking the planar 4-DOF manipulator as an example, numerical cal- 
culation is carried out to verify the effectiveness of the proposed control scheme. 
First, we will check the control performance without joint torque optimization by 
making HTJ" Fy in (6.23a) be 0. Secondly, a dynamic simulation example of joint 
torque optimization is introduced to illustrate the superiority of the control strategy. 
Finally, the adaptability and optimization performance of this method are verified by 
simulation experiments under different initial conditions. 


6.5.1 Simulation Setup 


As shown in Fig. 6.2, a contact surface in the workspace can be described as y = 0, 
the end-effector can move freely along the horizontal axis, and the desired con- 
tact force Fg is aligned with the vertical direction. The stiffness coefficient ky is 


6.5 Illustrative Examples 117 


Fig. 6.2 Physical structure 
of the 4-link robot 
manipulator 


set to be 1000N/mm. Control positive control gains are set as œ = 10, 6B = 10, 
k = 8, £ = 0.005, respectively. Physical constraints of joint angles, velocities and 
torque are defined as gmin — [—2, —2, —2, —2]" rad, 9™* = [2, 2, 2, 2] rad, gmin — 
[—2, —2, —2, —2]" rad/s,6™* = [2, 2, 2, 2]! rad/s, t™ = [—10, —10, —10, —10]" 
Nm, t™ = [10, 10, 10, 10]? Nm, respectively. 


6.5.2 Position-Force Control Without Optimization 


In this part, the robot is controlled to offer a constant contact force on the surface while 
tracking a given trajectory. It is remarkable that joint torque optimization is not inves- 
tigated yet. The initial joint angles are selected as 0) = [1.57, —1.26, —0.52, —0.52]" 
rad. The desired trajectory is defined as xy = [0.25 + 0.1cos(0.5t), 0]", and the con- 
tact force is defined as Fy = [0, —1]’ N. Numerical results are shown in Fig. 6.3. 
When simulation begins (t < 0.5s), the position error is big, and there is no contact 
between the robot and surface. Correspondingly, both contact force and result torque 
are zero. Under the RNN based controller (6.23), joint velocities reach the maximum 
value, the end-effector approaches to the surface rapidly from the initial position, and 
the tracking error converges to zero quickly, the corresponding joint configurations 
are shown in Fig. 6.3a. As the robot approaches the contact surface, the robot slows 
down quickly, and the contact force rises from zero, the then converges to the desired 
value smoothly. In the stable state (t > 2 s), both contact force F and end-effector 
track the desired command, the tracking errors are zero, which means the robot tracks 
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Fig. 6.3 Numerical results when tracking a time varying force command along a trajectory without 
optimization. a Profiles of the end-effector (black dashed line) and the corresponding joint config- 
urations. b Profiles of position error. ¢ Profiles of contact force. d Profiles of joint angles. e Profiles 
of joint velocities. f Profiles of joint torque 


both desired trajectory and force successfully. Correspondingly, joint angles change 
periodically, which enables the robot to achieve dynamic tracking. This will also lead 
to a periodic change in result torque in joint space, as shown in Fig. 6.3f. During the 
whole process, boundary of joint angles, velocities and torque are all ensured. 
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6.5.3 Position-Force Control with Optimization 


In this part, joint torque optimization is introduced to make full use of redundancy 
resolution. The proposed position-force control scheme is firstly validated in a fixed 
point case, and then extended to dynamic cases. 


(1) Position-Force Control on A Fixed Point 

In this simulation, the robot is wished to offer a constant contact force Fy = [0, —10]™ 
at a fixed point xa = [0.3,0]'. The initial values of joint angles are also 0 = 
[1.57, —1.26, —0.52, —0.52]'rad. Numerical results are shown in Fig. 6.4. At the 
beginning of simulation, the robot moves at its maximum speed (2 rad/s), making 
the regulation error converge quickly. Then it slows down as the regulation error 
is small. At t = 0.5 s, robot touches the surface, which leads to the emergence of 
contact force. Using the proposed controller, both control error of motion and force 
converge to zero smoothly. Correspondingly, the Euclidean norm of joint torque 
also converges to a constant value (3.7 N?/m?). From Fig. 6.4e, f, joint angles and 
velocities do not exceed their limits, showing that the proposed scheme could handle 
inequality constraints effectively. To further demonstrate the validity of the optimiza- 
tion scheme, comparative simulations without optimization are also carried out. The 
obtained Euclidean norm of joint torque without optimization is shown as the red 
dashed line (4.3 N?/m? in stable state). After introducing joint torque optimization 
strategy, 16% off of torque consumption is achieved. 


(2) Position-Force Control Along A Straight Line 

Then we check the optimization scheme in dynamic control. Both the desired 
path xq and desired force Fy are time varying. The expected signals are defined 
as xa = [0.25 + 0.1cos(0.5t), 0], Fy = [0, 20 — 2cos(0.5t)]" N, respectively. The 
initial values of joint angles are the same as the previous case. Numerical results 
are shown in Fig.6.5. We also define a index to scale the torque consumption as 
Je = fo ie @l iar. 

When simulation begins, high joint speed ensures the fast convergence of tracking 
error, which is very similar to the previous simulation. After t = 2s, high precision 
trajectory tracking is realized by the control strategy, as well as the contact force. 
Comparative simulation without joint torque optimization is carried out. Figure 6.5d 
shows the comparison of Euclidean norms of joint torque with and without optimiza- 
tion. Correspondingly, J; decreases 16.2% from 142 to 119, showing the validity 
of the proposed scheme. It is notable that all physical constraints are guaranteed. 
Dynamic change of joint configurations is shown in Fig. 6.5a. 


(3) Position-Force Control Along An Arc Surface 

In this part, the end-effector is controlled to track a quarter-circular surface, which 
is centered at [0.3, 0.3]'m with radius 0.2 m, and is provided a constant force of 
10N in the vertical direction. The initial values of joint angles are selected as 6) = 
[1.5708, —0.9851, —1.1714, O]'rad. Numerical results are shown in Fig. 6.6. The 
trajectory of end-effector is shown in Fig. 6.6a, while in Fig. 6.6b, optimization is not 
introduced. The proposed controller enables the robot to achieve precision control 
of both position and force, and at the same time, by adjusting its joint angles, the 
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Fig. 6.4 Numerical results when the robot is controlled to offer constant force at a fixed point. a 
Profiles of the end-effector (black dashed line) and the corresponding joint configurations. b Profiles 
of position error. ¢ Profiles of contact force. d Comparison of Euclidean norm of joint torque with 
and without optimization. e Profiles of joint angles. f Profiles of joint velocities 


joint torque consumption is reduced, i.e., J; decreases 17.6% from 88.1 to 72.6. It 
is remarkable that the physical constraints are also guaranteed. 


(4) Adaptability to Different Initial Settings 
To further illustrate the joint optimization scheme, another fixed-point control is 
presented. Desired signals are set as xq = [0.3, 0]? and Fy = [0, —10]". The initial 
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Fig. 6.5 Numerical results when tracking a time varying force command along a straight line with 
optimization. a Profiles of the end-effector (black curve) and the corresponding joint configurations. 
b Profiles of position error. ¢ Profiles of contact force. d Comparison of Euclidean norm of joint 


torque with and without optimization. e Profiles of joint anges. f Profiles of joint velocities 


values of joint angles is selected as 6) = [1.8850, — 1.8850, —1.2566, O]" rad, con- 
sequently, the corresponding position of the end-effector is exactly the same as xg. As 
shown in Fig. 6.7a, the robot adjusts its posture and stops in final state, while keep- 
ing its end-effector on the fixed point. This phenomenon is similar to the null-space 
movement based on pseudo-inverse methods. However, different with pseudo-inverse 
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Fig. 6.6 Numerical results when tracking a time varying force command along an arc surface with 
optimization. a Time history of the end-effector (black curve) and the corresponding joint config- 
urations with optimization. b Time history of the end-effector (black curve) and the corresponding 
joint configurations without optimization. ¢ Profiles of contact force. d Profiles of position error. 
e Comparison of Euclidean norm of joint torque with and without optimization. f Profiles of joint 
angles 


based method, the RNN based motion-force controller is capable of handling phys- 
ical inequalities, at the same time, joint torque optimization is achieved from 4.3 to 
3.7. Further more, there in no need to calculate pseudo-inverse of Jacobian matrix, 
which will save computing cost effectively. 
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Fig. 6.7 Numerical results when the initial position of end-effector locates on the desired fixed 
point. a Profiles of joint configurations. b Profiles of Euclidean norm of joint torque with and 
without optimization. ¢ Profiles of joint angles. d Profiles of joint velocities 


Finally, a group of verifications for fixed point position-force control with different 
initial joint angles are carried out, the desired signals are the same as the previous 
simulation. As shown in Fig.6.8, although the initial joint angles are different, at 
steady state, the robot reaches the same joint angle, which shows the adaptability of 
the RNN based control strategy. 


6.6 Question and Answer 


Q1: “What’s the complexity of the proposed RNN?” 

Answer: The network is organized in a one-layer architecture, which consists of 
2m + 3n neurons, namely w € R”, A; € R?” and Az € R”. Despite the difference 
between the proposed neural network with traditional recurrent neural networks, from 
both the mathematical description Eq. (23) and the architecture, one characteristic of 
the established neural network can be found that the neural network uses its historical 
information to calculate the output at current moment, which is also a typical feature 
of recurrent neural networks. 


124 6 RNN for Motion-Force Control of Redundant Manipulators ... 


0.5 0.5 0.5 
Initial as Initial 
staten 2 YW 
03 0.3 ff, 0.3 
E g g 
x = È 
0.1 0.1 0.1 
-0.1 -0.1 -0.1 
-0.2 0 02 04 06 08 0.2 0 0.2 04 06 0.8 02 0 0.2 04 06 08 
x(m) x(m) x(m) 
(a) (b) (c) 
0.5 0.5 0.5 
Final Final 
0.3 state Final 0.3 state 
= 0.3 state 
& 0.1 g i = o1 A 
> x $ Initial = A 
0.1 ale GA Initial 
-0.1 -0.1 state 
-0.3 -0.1 -0.3 — 
-0.2 0 02 04 06 08 -0.2 0 02 04 06 08 -0.2 0 0.2 04 06 08 
x(m) x(m) x(m) 
(d) (e) ®© 


Fig. 6.8 Time history of robot's joint configurations in fixed point control from different initial 
joint angle @. a 6o = [0.9, —0.75, —1.5, —1.6]" rad. b @ = [1.8, —0.3, —1.6, 0.6]" rad. € 6) = 
[1.9, —1.5, —1.6, —0.6]" rad. d @ = [0.5, —0.5, —1.6, —0.6]" rad. e 6) = [0.7, —0.3, —2, 0.2)" 
rad. f 6) = [0.3, —1.5, —1.6, —0.6]" rad 


Q2: “As described in Section II, the matrices Xp and X 'f are crucial in the 
controller design, however, the authors didnt show the details. How to obtain those 
matrices in actual applications requires detailed description.” 

Answer: Xp and X f are used to realize the decoupling of the contact force and 
tracking error of the end-effector. When the contact surface is known, the combination 
of Xp, >» f and S, enables the normalized description of the control tasks. 

Q3: “Limited stiffness of the manipulator elements can lead to state variables 
oscillations. Have you observed such work of the object?” 

Answer: The limited stiffness of the manipulator elements can lead to state vari- 
ables oscillations. In this chapter, the QP type formulation is obtained based on static 
modeling method, and inertial force is not taken into account. The condition of the 
modeling method is that the process is quasi-static. In other words, the relative motion 
of the end-effector and the workpiece is very slow. In the experiment tests, we also 
found that some oscillation would occur if some parameters are appropriately tuned. 
In this case, a damping coefficient can be introduced to handle the oscillations. 

Q4: “In real manipulator significant issue is related to control of electric drives. 
In mentioned structures, internal control loop, related to torque control, introduces 
some delays for external speed controllers. Have you considered such problem?” 

Answer: In this chapter, we mainly focus on projection RNN based controller 
design in kinematic level, and the control command is selected as joint velocity 
signals. Therefore, we assume that the robot controller can provide an ideal response 
to the joint velocity command. Although the delay is unavoidable for real systems, 
when the control frequency is set as 100 Hz, experimental results could show the 
effectiveness of the proposed controller. From Eq. (23), it can be observed that the 
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force control is realized by adjusting the joint velocities base on the RNN, which is 
consistent with the idea of admittance control. In our experiment, the velocity control 
in the inner loop is done by the robot controller, and we assume that it provide an 
ideal response to the joint velocity command. It is remarkable that the uncertainties 
in the dynamic level such as friction and disturbances do affect the performance of 
position-force control in the outer loop, but these uncertainties can be suppressed by 
the closed-loop control mechanism of the controller itself. 

Q5: “Could you explain real impact of projection operator PR on work of the 
control system?” 

Answer: The projection operator Pg plays an important role in guaranteeing the 
bounded-ness of the output of neural network i.e., the boundedness of œw can be 
ensured introducing Po. As described in Eq. (17), based on escape velocity method, 
both the boundedness of joint angles and velocities are guaranteed. 

Q6: “RNN uses delays during data processing, so the calculation step size seems 
to be important for overall work. Have you considered such issue?” 

Answer: We did consider this problem. The faster the RNN calculates, the better 
performance can be achieved. But at the same time, it would also lead to a increase 
of computational burden, which may make the system unstable. In our experiment, 
the control period is set to be 10 ms. 


6.7 Summary 


This chapter focuses on motion-force control problem for redundant manipulators, 
while physical constraints and torque optimization are taken into consideration. 
Firstly, tracking error and contact force are modeled in orthogonal spaces respec- 
tively, and then the control problem is turned into a QP problem, which is further 
rewritten in velocity level by rewriting objective function and constraints. To han- 
dle multiple physical constraints, an RNN based scheme is designed to solve the 
redundancy resolution online. Numerical experiment results show the validity of the 
proposed control scheme. Before ending this chapter, it is noteworthy that this is 
the first chapter to deal with motion-force control of redundant manipulators in the 
framework of RNNs and redundant manipulators with force sensitivity, e.g., grind- 
ing robots, can be readily controlled with the proposed RNN model but cannot with 
existing RNN models in this field. 
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